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SECTION  1 

EXECUTIVE  SUMMARY 


This  final  report  describes  research  in  optical  neural  networks  performed  at  Hughes  Aircraft 
Company  under  a  three-year  DARPA  sponsored  contract.  The  objective  of  demonstrating  a 
programmable  optical  computer  for  flexible  implementation  of  neural  network  models  was 
successfully  achieved.  The  advantages  of  optics  for  neural  network  implementations  include  high 
storage  capacity,  connectivity,  and  very  fine-grained  parallelism  which  results  in  high  computation 
rates.  The  optical  neurocomputer  developed  under  this  program  is  based  on  a  new  type  of 
holography,  cascaded  grating  holography,  in  which  the  neural  network  weights  are  distributed 
among  cascaded  angularly-  and  spatially-multiplexed  gratings.  This  approach  reduces  crosstalk 
and  improves  the  utilization  of  the  optical  input  device.  Successfully  implemented  neural  networks 
include  the  Perceptron,  Bidirectional  Associative  Memory,  and  backpropagation  neural  networks. 
Up  to  10^  neurons,  2x10^  weights,  and  processing  rates  of  2x10^  connection  updates  per  second 
were  achieved  on  this  program. 

The  organization  of  this  final  repon  is  as  follows.  First,  we  briefly  describe  the  nature  of 
neural  network  models  and  the  types  of  problems  they  are  meant  to  address.  We  describe  real-time 
holography  and  its  advantages  and  disadvantages  for  neural  network  implementations  in  terms  of 
storage  capacity,  connectivity,  and  parallel  processing.  A  new  holographic  technique,  cascaded 
grating  holography  (CGH),  was  developed  by  us  to  overcome  an  important  source  of  distortion  in 
holographic  neural  networks.  We  demonstrated  CGH  in  photorefractive  BaTi03  crystals  using 
both  visible  light  (514  nm)  from  an  argon  laser  and  infrared  light  (830  nm)  from  a  laser  diode.  The 
infrared  experiments  arc  especially  significant  because  very  compact  systems  can  be  built  using 
laser  diode  light  sources. 

We  then  discuss  the  design  and  construction  of  the  optical  neurocomputer  based  on  CGH. 
The  number  of  neurons,  number  of  layers,  and  the  neuron  activation  function  can  all  be 
programmed  without  hardware  adjustments.  We  believe  ours  is  the  first  truly  programmable 
optical  neurocomputer.  In  addition,  the  simple  system  design  requires  only  a  single  crystal,  input 
spatial  light  modulator,  and  output  detector  regardless  of  the  network  configuration.  The  entire 
system  was  built  from  presently  available  off-the-shelf  components.  Although  the  present  research 
system  occupies  a  relatively  large  volume,  we  discuss  packaging  concepts  in  which  the  entire 
system  is  contained  within  a  volume  smaller  than  a  shoe  box. 

Three  widely  differing  neural  network  algorithms  (Perceptron,  bidirectional  associative 
memory,  and  backpropagation)  were  successfully  implemented  on  the  neurocomputer.  As  of  this 
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writing,  up  to  10,000  neurons,  2x10^  weights,  and  learning  rates  of  2x10^  connection  updates  per 
second  have  been  achieved.  The  potential  exists  for  improvement  factors  of  10^  in  the  number  of 
weights  and  10^  in  the  processing  rate.  Finally,  we  discuss  our  experimental  efforts  in  "fixing"  or 
permanently  recording  connection  weights  in  the  holographic  medium. 
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SECTION  2 

NEURAL  NETWORK  MODELS  OF  COMPUTATION 


Computational  problems  can  be  described  in  terms  of  many  attributes.  One  of  the  most 
fundamental  measures  is  the  "randomness"  or  algorithmic  complexity  of  the  problem.  It  is 
generally  agreed  that  nonrandom  highly  structured  problems  with  known  algorithmic  solutions  are 
best  solved  using  traditional  computer  programs.  In  contrast,  many  problems  involving  natural 
data  have  a  high  degree  of  randomness,  especially  pattern  recognition  problems  involving  noisy 
data.  These  are  problems  that  biological  organisms  excel  at  compared  with  classical  rule-based 
algorithms  in  which  the  rules  are  assumed  a  priori.  Biologically  inspired  neural  networic  models 
are  useful  for  solving  such  high  entropy  problems  in  which  the  underlying  algorithm  is  unknown 
and  the  required  transformations  must  be  learned  from  examples.  They  are  more  powerful  than 
standard  statistical  techniques  because  a  larger  range  of  solutions  can  be  represented  using  their 
multi-layer  nonlinear  structure.  In  fact,  it  has  been  shown  that  any  function  can  be  approximated 
by  a  sufficiently  large  neural  network.  ^ 

Neural  network  models  of  computation  consist  of  many  simple  processing  nodes  or 
"neurons"  which  communicate  with  each  other  via  interconnection  weights.  The  nodes  are 
anthropomorphically  called  neurons  here  in  acknowledgment  of  the  vastly  more  sophisticated 
biological  neurons  which  inspire  connectionist  models  of  computing.  (However,  any  resemblance 
of  the  nodes  to  actual  neurons  is,  at  best,  superficial  and  grossly  simplified.)  A  diagram  of  the 
logical  structure  of  a  multi-layer  feed-forward  neural  network  is  shown  in  Fig.  1 .  Associated  with 
each  neuron  is  :ji  activation  level  which  is  calculated  from  a  weighted  sum  of  the  activity  levels  of 
other  neurons. 


Patterns  which  are  input  to  the  network  through  the  bottom  layer  are  transformed  into  patterns 
which  represent  the  answer  to  the  problem  the  network  is  trained  to  solve.  Many  neural  net 
architectures  have  been  designed  and  demonstrated  for  various  computing  tasks.  Techniques  such 
as  backpropagation  as  well  as  many  others  have  been  developed  to  "train"  or  adjust  the 
interconnection  weights  to  solve  problems  in  pattern  recognition,  vision,  and  robotic  control. 
Although  the  problem  of  learning  an  arbitrary  transformation  for  a  completely  random  problem  has 
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been  shown  by  Judd^  to  be  NP-complete  (therefore  probably  requiring  exponential  increases  in 
learning  time  as  the  problem  size  increases),  most  if  not  all  problems  in  pattern  recognition  involve 
at  least  partially  structured  data  for  which  the  learning  time  will  be  a  polynomial  function  of  the 
problem  size. 


Output  Pattern 

t 


Input  Pattern 


Fig.  1.  Structure  of  feed-forward  neural  networks. 

Two  important  parameters  which  characterize  a  neural  network  are  the  number  of  neurons, 
N,  and  their  connectivity,  K,  where  K  is  the  number  of  synapses  or  weights  connected  to  a 
neuron.  We  argue  that  both  N  and  K  should  be  as  large  as  possible  in  any  general  purpose 
neurocomputer.  Pattern  recognition  problems  involving  natural  data,  especially  vision  or  target 
recognition,  require  large  numbers  of  input  neurons  to  handle  the  large  raw  input  data  rate.  In 
addition,  large  numbers  of  neurons  in  the  hidden  layer  are  required  in  order  to  solve  non-trivial 
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problems.  As  for  the  connectivity,  Abu-Mostafa^  has  shown  that  it  must  be  large  for  at  least  two 
reasons.  First,  since  the  neurons  essentially  implement  K-input  threshold  functions,  one  K-input 
neuron  with  K  associated  weights  is  equivalent  to  order  two-input  neurons  with  2KP-  associated 
weights.  Thus  far  fewer  weights  are  required  if  the  neurons  have  high  fan-in  and  fan-out. 

Second,  Abu-Mostafa  also  showed  that  in  order  for  a  neural  network  to  leam,  the  connectivity  K 
must  exceed  the  entropy  H  of  the  environment  where  H  is  the  log2  of  the  number  of  input  patterns 

typically  generated  by  the  environment.  H  increases  as  the  randomness  of  the  problem  increases. 
Since  neural  networks  are  most  useful  for  partially  random  problems,  we  argue  that  both  the 
number  of  neurons  and  their  connectivity  should  be  as  large  as  possible  in  a  neuro-computer. 
Increasing  N  and  K  while  maintaining  computational  parallelism  is  a  daunting  task  for  electronic 
architectures  due  to  the  2-D  nature  of  electronic  interconnects.  Most  of  the  area  on  analog  or  digital 
electronic  neuro-chips  is  taken  up  by  the  interconnects  while  implementing  only  modest  numbers 
of  neurons  (N=O(102)). 

Optical  implementations  of  neural  networks  are  attractive  because  of  the  large  storage 
capacity  and,  most  importandy,  the  parallel  access  and  processing  capabilities  of  opdes.  Optical 
architectures  can  exploit  3-D  free  space  interconnects,  allowing  the  input  and  output  planes  to  be 
fully  populated  with  highly  interconnected  neurons  (N=O(105)).  Moreover,  an  entire  weight  layer 
can  be  updated  in  one  time  step,  unlike  electronic  approaches  which  must  use  some  form  of  time 
multiplexing  when  the  number  of  neurons  is  large.  Optical  neural  networks  divide  naturally  into 
two  classes  distinguished  by  the  dimensionality  of  the  weight  storage  medium,  e.g.  weights  can  be 
stored  in  2-D  or  3-D  formats.  Example  2-D  weight  storage  media  include  film,  SLMs,  and  optical 
disks  while  almost  all  of  the  3-D  formats  use  photorefractive  crystals  as  the  storage  medium.  The 
optical  processor  described  here  uses  3-D  weight  storage  based  on  volume  holography.  The 
primary  motivation  for  considering  volume  holograms  as  a  storage  medium  for  neural  networks  is 
the  potential  for  extremely  high  storage  capacity  and  fully  parallel  processing  of  the  weights  during 
both  the  learning  and  reading  phases.'^  Motivation  for  maximizing  these  parameters  can  be  found 
in  the  potential  application  areas  for  neural  networks. 

Figure  2  is  an  adaptation  of  a  figure  which  originally  appeared  in  the  final  repon  of  the  1988 
DARPA  Neural  Network  Study.  It  shows  the  potential  application  areas  mapped  onto  a  2-D  space 
in  which  one  axis  is  the  storage  required  in  terms  of  the  number  of  weighted  connections  and  the 
other  axis  is  the  processing  rate  required  in  connections  per  second.  The  corresponding  estimated 
parameters  of  some  biological  systems  are  included.  A  variety  of  electronic  implementations, 
denoted  by  open  squares,  have  also  been  added  to  Fig.  2.  (These  performance  points  were  taken 
from  an  anicle  by  Alspector.)  The  electronic  implementations  are,  apan  from  the  Sun  and  Cray 
computers,  specialized  analog  and  digital  chips.  Some  implement  learning  and  some  do  not.  For 
those  that  don't,  weight  values  must  be  learned  off-chip  and  subsequently  loaded  into  the  chip. 
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Finally,  diagonal  lines  of  constant  network  update  time  were  added  to  the  figure.  The  update  time 
is  the  time  required  for  information  to  pass  from  the  input  of  the  network  to  the  output.  Such  a  plot 
is  necessarily  a  highly  folded  and  simplified  projection  of  a  high  dimensional  reality  onto  a  low 
dimensional  representation.  For  example,  the  degree  of  local  vs  global  connectivity  is  ignored  as 
well  as  the  algorithmic  complexity  of  the  application.  Nevertheless,  cenain  trends  can  be  deduced. 


STORAGE  (CONNECTIONS) 

Fig.  2.  Relative  performance  of  present  holographic  neural  network  (HONN)  compared  to 
application  requirements  and  other  hardware  implementations. 

It  is  interesting  to  note  that  potential  application  areas  in  robotics,  speech,  and  vision  cluster 
around  the  10  msec  network  update  time  line.  These  applications  require  large  numbers  of  weights 
in  order  to  achieve  the  complexity  required  to  solve  the  problem,  but  the  problem  does  not  need  to 
be  solved  in  less  than  a  few  milliseconds.  The  large  number  of  weights,  however,  requires  very 
high  processing  rates  to  achieve  this  update  time.  Significantly,  the  biological  systems  also  cluster 
around  the  10  msec  update  line,  perhaps  because  they  must  also  solve  problems  in  speech  and 
vision.  With  the  exception  of  the  general  purpose  computers,  the  electronic  implementations  have 
relatively  modest  stOTage  capacity,  although  their  processing  rates  are  high.  (Igncmng  the  fact  that 
many  of  the  chips  do  not  have  on-chip  learning.)  This  is  due  to  the  2-D  nature  of  VLSI  which 


6 


limits  the  number  of  connections  that  are  practical.  They  therefore  appear  most  suited  to  the  signal 
processing  applications  in  which  fast  update  times  and  modest  storage  are  required. 

It  is  our  opinion  that  optical  neurocomputers  are  complementary  to  specialized  electronics 
in  that  the  3-D  connectivity  and  parallelism  of  optics  permits  the  implementation  of  very  large 
networks  with  high  processing  rates  and  relatively  modest  network  update  times  suitable  to 
applications  in  vision.  The  projected  future  performance  of  our  optical  neurocomputer  is  indicated 
by  the  large  filled  circle  labeled  ONN.  The  present  performance  of  our  optical  neurocomputer  is 
also  indicated  by  the  filled  square.  Despite  being  a  proof-of-principle  first  prototype  system  using 
non-optimized  components,  its  performance  is  quite  good. 

In  order  to  fulfill  a  practical  role  complementary  to  the  strengths  of  electronics,  optical 
neurocomputers  should  include,  first,  large  numbers  of  neurons  and  weights;  second,  distortionless 
software  mapping  of  a  variety  of  neural  network  algorithms  onto  the  hardware  with  no  hardware 
reconfiguration;  third,  co-processor-type  interfacing  to  a  host  computer;  and  fourth,  hardware 
simplicity  for  low  cost  and  compact  packaging.  In  this  final  report  we  describe  an  experimental 
optical  neurocomputer  which  serves  as  a  testbed  for  meeting  these  requirements.  The 
neurocomputer  is  a  nonlinear,  highly  interconnected,  parallel,  and  analog  opto-electronic  computer 
based  on  real  time  holography.  In  the  next  few  sections  we  discuss  some  aspects  of  holography 
relevant  to  neural  network  implementation. 
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SECTION  3 


REAL-TIME  HOLOGRAPHY  FOR  NEURAL 

NETWORKS 

In  holographic  optical  neural  networks,  neurons  are  represented  by  pixels  on  SLMs.  The 
brightness  of  a  pixel  corresponds  to  the  activation  level  of  the  neuron.  By  placing  the  SLM  in  the 
back  focal  plane  of  a  lens  and  using  coherent  readout,  as  shown  in  Fig.  3,  the  pixels  are  converted 
to  coherent  beams  which  illuminate  a  real-time  holographic  medium.  Weights  between  neurons  are 
formed  when  a  pair  of  light  beams  interfere  in  the  holographic  medium,  forming  a  volume 
sinusoidal  light  intensity  pattern.  The  photorefiractive  effect  is  a  suitable  physical  mechanism  for 
converting  this  light  intensity  pattern  into  a  semipermanent  deformation  of  the  optical  properties  of 
the  material,  thereby  recording  the  weight  values. 


Fig.  3.  Geometry  for  holographically  recording  connection  weights  between  neurons. 


In  the  photorefiractive  effect  incident  light  excites  carriers  (electrons  and/or  holes)  from  traps 
into  the  conduction  or  valence  band.  These  carriers  then  are  transported  by  diffusion  and  drift  until 
they  fall  into  empty  traps,  creating  an  internal  space-charge  field  which  in  turn  nKxlulates  the 
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birefringence  of  the  material  through  the  electro-opQc  effect.  This  results  in  a  phase  grating  in  the 
material.  Because  of  the  long  dark  decay  times  of  some  of  these  materials,  the  phase  gratings  can 
be  stored  with  a  time  constant  of  many  hours.  (Storage  for  longer  periods  is  also  possible  in  some 
materials  usmg  various  fixing  methods.)  When  one  of  the  original  two  beams  subsequently 
addresses  the  grating,  the  other  beam  is  reconstructed  with  a  diffraction  efficiency  that  represents 
the  weight  value  between  those  two  neurons. 

The  diffraction  efficiency  of  the  semi-permanent  phase  grating  represents  the  connection 
weight  formed  between  the  neurons.  It  is  proportional  to  the  outer-product  of  the  amplitudes  of  the 
writing  beams  and  has  a  fonuitous  similarity  to  the  Hebbian  learning  rules  of  many  neural  network 
models.  This  equivalence  between  the  ou»er-product  foim  of  the  diffraction  efficiency  and  Hebbian 
learning  forms  the  basis  for  implementing  the  weights  directly  using  the  analog  laws  of  physics 
rather  than  digital  representations  as  in  conventional  computers.  Learning  can  be  implemented  in 
photorefractive  optical  neural  networks  since  the  weights  can  be  selectively  increased  or  decreased. 
Reading  out  the  grating  partially  erases  it  unless  the  readout  beam  is  much  weaker  than  the  original 
writing  light  or  the  crystal  is  fixed  using  special  techniques. 

The  angular  or  Bragg  selectivity  of  a  volume  photorefractive  grating  can  be  very  high  which 
results  in  large  storage  capacities.  (For  example,  500  high  quality  images  consisting  of  70,000 
pixels  each  have  been  stored  in  a  single  photorefractive  crystal.^)  The  Bragg  condition  states  that  a 
beam  will  be  reconstructed  only  if  the  angle  of  incidence  of  the  incident  beam  relative  to  the  grating 
is  equal  to  that  of  the  original  writing  beam.  The  angular  selectivity  for  reconstruction  can  be 
calculated  from  coupled  mode  theory^  and  is  given  by 


nT^  sin  (p 


where  Tz  is  the  grating  thickness,  X  is  the  optical  wavelength,  n  is  the  index  of  refraction,  and  ()>  is 
the  angle  between  the  two  writing  beams.  Note  that  the  selectivity  is  greater  for  thicker  crystals. 
Each  individual  light  beam  can  be  represented  by  a  momentum-  or  k-vector.  (The  direction  of  the  k- 
vector  corresponds  to  the  direction  of  propagation  and  the  magnitude  of  the  k-vcctor  is  the  inverse 
of  the  wavelength.)  By  using  phase  matching  arguments,  the  Bragg  condition  can  be  described 
geometrically  as  a  vector  sum:  Kj  +  Kg  =  Kj,  where  Kj  and  Ki  are  the  wavevectors  of  the 
incident  and  diffracted  beams,  respectively,  and  Kg  is  the  grating  wavevector. 

A  geometrical  construction  for  the  theoretical  storage  capacity  of  a  volume  hologram  can  be 
drawn  in  k-space,  as  shown  in  Fig.  4.  If  one  writing  beam  varies  over  solid  angle  6o  while  the 
second  writing  beam  varies  over  angle  6^,  then  the  vector  difference  between  the  two  beams  (the 
grating  wavevector  Kg)  will  trace  out  a  three-dimensional  region  in  k-space  (shaded  gray  in  Fig.  4). 
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This  volume  represents  the  region  of  k-space  that  is  accessible  for  storage  of  information.  It  will  be 
further  limited  by  the  resolution  or  modulation  transfer  function  (MTF)  of  the  holographic  medium. 
This  limit  is  represented  by  a  sphere  centered  on  the  origin  whose  radius  is  equal  to  the  largest 
spatial  frequency  that  can  be  resolved  by  the  medium.  The  volume  of  the  accessible  region  depends 
on  such  geometrical  factors  as  the  focal  lengths  of  the  optics  and  the  spacing  of  the  neurons  on  the 
SLMs. 


Volume  of  possible 
grating  k-vectors; 


Accessable 
Not  Accessable 


Fig.  4.  K-space  diagram  of  grating  k-vectors  accessible  using  recording  geometry  of  Fig.  2. 

The  grating  wavevector  Kg  has  an  uncertainty  volume  associated  with  it  due  to  the  finite 
physical  size  of  the  hologram  and  the  nonzero  size  of  the  SLM  pixels.  Dividing  the  accessible 
volume  of  k-space  by  the  volume  of  the  uncertainty  volume  results  in  the  maximum  theoretical 
number  of  resolvable  gratings  which  can  be  stored  in  the  photorefractive  crystal.  Without  going  into 
details  here  it  can  he  shown  that  the  storage  capacity  is  limited  by  two  upper  bounds  due  to  the 
hologram  and  neuron  dimensions: 
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where  V^ojo  is  the  hologram  volume,  f  is  the  lePi  focal  length,  dneuron  the  SLM  pixel  diameter, 
tOQ  is  the  object  solid  angle,  and  a2*^  are  me  angular  limits  of  excursion  of  the  reference 
beam,  6  is  the  mean  object-reference  beam  angle,  and  n}  is  the  crystal  index  of  refraction.  It  was 
assumed  in  the  calculations  that  led  to  the  above  result  that  the  hologram  MTF  limit  is  high  enough 
to  be  ignored.  For  an  active  crystal  volume  of  a  few  cubic  mm  and  reasonable  optical  parameters 
the  values  of  the  two  upper  bounds  range  from  iQl®  to  10*2  gratings.  This  is  sufficient  to  form  a 
fully  interconnected  network  of  10^  to  10^  neurons.  Moreover,  each  grating  can  be  read  out  or 
updated  in  parallel  without  the  time  multiplexing,  data  contention,  or  bottleneck  problems  common 
in  electronic  architectures. 

The  high  theoretical  storage  capacity  and  parallel  access  to  weights  is  a  direct  result  of  the 
three-dimensional  nature  of  optical  holographic  storage.  However,  other  nonfundamental  factors 
also  limit  the  practical  performance.  The  challenge  is  to  construct  optical  neurocomputers  that 
reduce  the  difference  between  actual  performance  and  theoretical  limits  by  attacking  these  factors. 
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SECTION  4 

LIMITATIONS  OF  CONVENTIONAL  HOLOGRAPHY 


A  variety  of  mechanisms  limit  the  practical  holographic  storage  capacity  of  photorefractive 
crystals  to  values  below  the  theoretical  limit.  They  include  hologram  dynamic  range,  Bragg 
degeneracy,  beam  coupling,  and  laser,  hologram,  and  detector  noise.  In  this  section  we  discuss 
limitations  due  to  Bragg  degeneracy  and  beam  coupling,  factors  which  can  be  alleviated  using 
cascaded-grating  holography. 

4.1  BRAGG  DEGENERACY 

One  of  the  most  important  impediments  to  holographic  neural  networks  is  crosstalk  which 
arises  from  an  effect  known  as  "Bragg  degeneracy."  The  Bragg  condition  states  that  the  angle  of 
incidence  of  a  light  beam  relative  to  a  volume  grating  must  match  one  of  the  original  writing  beams 
in  order  to  form  an  optical  connection.  However,  even  if  the  angular  selectivity  is  high,  crosstalk 
can  still  occur.  Given  a  particular  grating,  it  is  possible  for  many  light  beams  to  satisfy  the  Bragg 
condition  for  that  grating,  in  addition  to  the  beams  which  originally  wrote  the  grating.  As  shown  in 
the  momentum  space  diagram  of  Fig.  5,  a  set  of  beam  pairs  which  define  the  surfaces  of  two  end- 
to-end  cones  all  form  the  same  angle  with  respect  to  the  grating.  All  of  the  neuron  pairs  defined  by 
the  cones  are  connected  by  that  grating  even  though  it  was  written  by  only  one  grating  pair. 
Therefore,  a  large  set  of  beams  other  than  the  original  writing  beam  can  scatter  constructively  from 
the  grating,  forming  erroneous  reconstructions  and  crosstalk. 
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An  experimental  demonstration  of  Bragg  degeneracy  is  shown  in  Fig.  6.  We  recorded  a 
hologram  in  a  c-cut  BaTi03  crystal  using  a  laser  diode  light  source  with  a  wavelength  of  830  nm. 

(The  wavelength  and  crystal  geometry  were  chosen  for  low  two-wave  mixing  gain  in  order  to 
eliminate  beam  fanning  which,  as  will  be  shown  later  in  this  report,  can  be  used  to  eliminate  Bragg 
degeneracy.)  The  experimental  configuration  of  Fig.  3  was  used.  During  recording  the  object 
plane  consisted  of  a  regular  rectangular  grid  pattern  and  the  reference  plane  was  a  uniformly  filled 
rectangle  (all  pixels  on).  When  the  hologram  was  read  out  using  the  uniform  reference,  the 
reconsmicted  object  was  smeared  in  the  vertical  direction.  No  horizontal  smearing  can  be  observed. 
This  can  be  easily  explained  by  considering  the  k-space  diagram  of  Fig.  7.  In  Fig.  7(a).  it  can  be 
seen  that  the  grating  k-vector  created  by  beams  R1  and  01  also  connects  all  beams  above  and  below 
R1  in  the  R  plane.  These  beams,  such  as,  for  example,  R2,  reconstruct  extraneous  beams  such  as 
02  which  appear  above  and  below  the  original  beam  01.  This  results  in  vertical  smearing.  In 
Fig.  7(b)  we  see  that  horizontally  displaced  beams  are  not  connected  by  the  same  grating  k-vector, 
therefore  horizontal  smearing  is  much  less.  Note  that  Bragg  degeneracy  cannot  be  eliminated  by 
simply  phase  aberrating  the  reference  beam  since  the  above  construction  would  still  hold  for  the 
individual  plane  wave  components. 

Possible  approaches  for  avoiding  Bragg  degeneracy  are  subsampling  of  the  SLMs”^  and  spatial 
multiplexing  of  holograms.^  In  the  subsampling  approach,  neurons  are  arranged  in  sparse 
nonredundant  patterns  on  the  SLMs,  and  output  planes  are  similarly  sparsely  sampled;  thus 
although  false  reconstructions  still  occur,  they  occur  at  unused  positions  and  do  not  contribute  to  the 
output.  The  special  patterns  can  consist  of  so-called  "fractal"  lattices  or  other  sparse  patterns.  If  the 
SLMs  are  capable  of  displaying  NxN  neurons,  then  this  approach  can  implement  a  total  of  N3/2 
neurons  and  N^  weights.  This  has  the  pleasing  quality  that  the  storage  capacity  of  the  crystal  and 
the  number  of  weights  required  to  fully  interconnect  the  neurons  on  the  sampled  SLMs  have  the 
same  dimensional  scaling.  In  many  practical  cases,  however,  it  also  has  the  drawback  that  the 
storage  capacity  may  be  limited  by  the  number  of  neurons  that  can  be  displayed  in  sparse  patterns 
on  the  SLM,  rather  than  by  the  potentially  large  capacity  of  the  crystal.  As  discussed  above,  the 
storage  capacity  of  a  1  cm^  crystal  should  be  sufficient  to  store  the  interconnections  for  a  NxN 
array  of  neurons  where  N=5{)0,  which  matches  the  capabilities  of  present-day  SLMs.  However, 
because  of  the  subsampling,  only  N3/2  neurons  can  be  implemented  even  though  the  SLM  is 
capable  of  displaying  N^  neurons.  Since  N=5()0,  the  neuron  and  weight  storage  capacities  are 
reduced  by  factors  of  22  and  500,  respectively,  from  the  theoretical  maximums.  The  light 
efficiency  is  also  lowered  because  some  of  the  light  is  diffracted  to  unused  pixels  as  a  result  of  the 
Bragg  degeneracy. 
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Fig.  6.  Experimental  demonstration  of  Bragg  degeneracy  using  infrared  hologram  recorded  in 
c-cut  BaTi03.  Vertical  smearing  is  a  sign  of  Bragg  degeneracy. 


14 


The  spatial  multiplexing  approach  avoids  the  Bragg  degeneracy  problem  by  physically 
dividing  the  crystal  into  separate  volumes  for  each  weight.  However,  this  reduces  the  coupling 
efficiency  of  gratings,  reduces  parallelism  because  sequential  exposures  must  be  used,  and 
increases  hardware  complexity. 


(a) 

VERTICAL  CROSSTALK 
(FRONT  VIEW) 
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HORIZONTAL  CROSSTALK 
(TOP  VIEW) 


Fig.  7.  K'Space  diagram  showing  origin  of  vertical  smearing  efTect  in  Bragg  degeneracy. 

Vertically-displaced  pairs  of  reference-object  points  are  all  connected  by  the  same 
grating  k-vector. 


4.2  ANGULAR  WIDTH  BRAGG  DEGENERACY 

In  this  section  we  discuss  a  previously  unidentified  effect  related  to  the  Bragg  degeneracy 
that  can  occur  in  holographic  interconnection  systems.  This  effect  results  in  further  unintended 
Bragg  degeneracies  due  to  the  angular  width  of  the  Bragg  diffraction  responses  of  volume 
holographic  gratings.  We  hereafter  refer  to  this  effect  as  angular  width  Bragg  degeneracy.  We 
define  it  more  precisely  in  Section  1.3.  The  main  practical  result  of  our  investigations  is  that  one  of 
two  design  rules  for  developing  fractal  sampling  grids  (that  are  used  to  avoid  the  effects  of  Bragg 
degeneracy)  does  not  always  work;  i.e.,  while  this  rule  can  be  used  to  avoid  direct  Bragg 
degeneracy  crosstalk,  it  does  not  necessarily  eliminate  angular  width  Bragg  degeneracy  and  its 
associated  crosstalk.  For  holographic  interconnection  systems  that  utilize  fractal  sampling  grids  to 
avoid  Bragg  degeneracy  it  is  thus  important  to  be  aware  of  the  conditions  under  which  angular 
width  Bragg  degeneracy  occurs  and  to  use  the  correct  fractal  sampling  grid  design  rule  to  avoid  its 
presence  in  the  desired  interconnections. 


15 


in  Section  4.2. 1  we  briefly  review  Bragg  degeneracy  and  its  effects  on  interconnection 
systems.  We  discuss  the  two  design  rules  that  have  been  proposed  to  develop  fractal  sampling 
grids.  In  Section  4.2.2  we  describe  angular  width  Bragg  degeneracy  in  more  detail  and  illustrate 
its  effects  using  numerical  modeling  examples  based  on  a  three  dimensional,  multiple  grating 
version^  of  the  optical  beam  propagation  method  (BPM).^®  Our  findings  are  summarized  in 
Section  4.2.3,  and  the  relationship  of  angular  width  Bragg  degeneracy  to  the  fundamental 
interconnection  capacity  of  holographic  media  is  briefly  addressed. 

4.2.1  Bragg  Degeneracy  and  Fractal  Sampling  Grids 

As  discussed  in  Section  4.1,  Bragg  degeneracy  occurs  when  a  grating  that  connects  a 
desired  pair  of  pixels  also  inadvenently  connects  one  or  more  additional  pairs  of  pixels  due  to  the 
two  cones  of  angles  that  are  Bragg  matched  to  that  particular  grating.  1  *  One  method  of  visualizing 
which  pixels  in  the  interconnection  system's  input  and  output  planes  have  degenerate 
interconnections  is  to  project  the  cones  of  Bragg-matched  angles  onto  the  pixel  planes. as  is 
illustrated  in  Fig.  8,  for  a  given  grating  this  projection  results  in  two  lines,  one  on  each  pixel  plane. 
The  two  degeneracy  lines  shown  in  the  figure  correspond  to  the  grating  connecting  the  top-most 
pixel  pair  (joined  by  a  solid  line).  Due  to  Bragg  degeneracy,  this  grating  also  connects  the  pair  of 
pixels  joined  by  the  dashed  line  since  these  pixels  are  on  the  degeneracy  lines. 

Output  Input 


Fig.  8.  Bragg  degeneracy  causes  a  desired  interconnection  (such  as  between  the  pixels  joined  by 
the  solid  One)  to  inadvertently  connect  other  pixel  pairs  (such  as  the  pixels  joined  by  the 
dashed  line)  that  are  on  the  degeneracy  lines.  Such  pixel  pairs,  together  with  the  original 
pbtel  pair,  form  the  vertices  of  a  rectangle. 
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As  has  been  generally  recognized,  Bragg  degeneracy  can  be  a  significant  source  of 
crosstalk  in  holographic  interconnection  systems  that  superimpose  all  of  the  desired  interconnection 
gratings  within  the  same  volume  of  material.  Bragg  degeneracy  can  seriously  affect  the  fidelity  of 
holographic  interconnections  in  at  least  two  ways:  1)  the  recording  of  a  desired  grating 
inadvenently  connects  undesired  pairs  of  pixels,  and  2)  if  several  pairs  of  pixels  are  connected  by 
gratings  that  are  degenerate  then  the  actual  weighted  interconnecdon  between  each  of  the  pairs  is 
the  sum  of  the  complex  (magnitude  and  phase)  grating  amplitudes.  In  the  first  case,  completely 
undesired  interconnections  are  formed,  and  in  the  second  case  desired  interconnections  have 
incorrect  weights.  For  holographic  interconnection  systems  in  which  Bragg  degeneracy  can  occur, 
it  is  thus  important  to  eliminate  its  effects.  The  use  of  so-called  fractal  sampling  grids  for  the  input 
and  output  planes  has  often  been  suggested  for  this  purpose. 

The  use  of  fractal  sampling  grids  involves  creating  patterns  of  pixels  on  the  input  and 
output  planes  that  can  be  interconnected  only  by  non-degenerate  gratings.  The  direct  crosstalk  due 
to  Bragg  degeneracy  is  thus  avoided,  but  at  the  cost  of  reduced  spatial  sampling  of  the  input  and 
output  plane  pixelation.  For  example,  if  an  interconnection  system  is  implemented  using  two- 
dimensional  pixel  planes  that  are  each  composed  of  pixels  arranged  as  an  evenly  spaced  M  by 
M  array,  the  largest  number  of  usable  pixels  in  each  plane  is  M3/2  for  an  N-to-N  interconnection. 

Several  design  rules  have  been  proposed  *3  to  generate  fractal  sampling  grids.  These  can  be 
summarized  as  the  "no  rectangle  rule"  and  the  “unequal  row  spacing  rule.”  Each  rule  is  claimed  to 
be  a  sufficient  condition  for  avoiding  undesired  crosstalk  due  to  Bragg  degeneracy. 

The  rectangle  rule  can  be  understood  as  follows.  Note  in  Fig.  8  that  when  two  pairs  of 
pixels  (each  pixel  pair  consisting  of  one  pixel  in  the  input  plane  and  one  pixel  in  the  output  plane) 
are  connected  by  degenerate  gratings,  the  pixel  pairs  form  the  vertices  of  a  rectangle.  Use  of  the  no 
rectangle  rule  consists  of  generating  patterns  of  pixels  for  the  input  and  output  planes  such  that  no 
rectangles  can  be  formed  having  two  pixels  in  the  input  plane  and  two  pixels  in  the  output  plane  as 
venices.  An  example  of  a  set  of  input  and  output  plane  grid  patterns  that  satisfies  the  no  rectangle 
rule  is  shown  in  Fig.  9(a)  for  a  27 -to- 27  interconnection  system  utilizing  by  9  pixel  arrays.  The 
large  and  small  dots  refer  to  used  and  unused  pixel  sites,  respectively. 

An  example  set  of  grid  patterns  generated  using  the  unequal  row  spacing  rule  is  shown  in 
Fig.  9(b).  The  operating  principle  here  is  to  select  rows  of  pixels  that  are  venically  separated  by 
unequal  numbers  of  pixels  in  the  input  and  output  planes.  In  the  example  shown  in  Fig.  9(b),  the 
rows  in  the  input  plane  are  separated  by  either  4  or  8  pixels,  while  the  row  separation  in  the  output 
plane  is  3  or  6  pixels.  As  we  shall  see  below,  fractal  sampling  grids  generated  with  the  unequal 
row  spacing  rule  automatically  avoid  angular  width  Bragg  degeneracy  as  well  as  normal  Bragg 
degeneracy.  This,  however,  is  not  true  for  grids  generated  with  the  no  rectangle  rule. 
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Two  possible  fractal  sampling  grids  for  a  27-27  interconnection  system,  (a)  A  fractal 
sampling  grid  that  satisfies  the  no  rectangle  rule,  but  not  the  unequal  row  separation 
rule,  (b)  A  fractal  sampling  grid  designed  using  the  unequal  row  separation  rule. 


4.2.2  Deflnition  of  Angular  Width  Bragg  Degeneracy 

In  this  section  we  explain  more  clearly  the  nature  of  angular  width  Bragg  degeneracy,  how  it 
arises,  its  effect  on  weighted  interconnecdons,  and  how  it  may  be  avoided.  In  the  literature,  Bragg 
degeneracy  is  described  only  in  terms  of  exact  angular  Bragg-alignment  of  undesired  pixel  pairs  to 
a  degenerate  grating.  Such  a  description  neglects  an  important  point:  the  angular  sensitivity  of  a 
volume  grating's  diffraction  response  has  a  finite  angular  width  about  its  Bragg  angles.  The 
angular  FWHM  of  this  diffraction  response,  A6,  is  approximately  AID  (measured  in  the 
holographic  medium)  in  which  A  is  the  period  of  the  grating  and  D  is  the  optical  path  length  of  the 
readout  beam  through  the  medium.  The  possibility  therefore  exists  for  an  interconnection  grating 
to  inadvertently  connect  an  unintended  pair  of  pixels  if  each  pixel  is  within  an  angular  range  of 
approximately  ABol  the  grating's  Bragg  angles.  This  is  what  we  mean  by  angular  width  Bragg 
degeneracy. 

We  illustrate  this  effect  and  compare  it  to  normal  (Bragg-matched)  Bragg  degeneracy  using 
the  following  pair  of  examples.  Consider  two  columns  of  pixels  as  shown  in  Figs.  10(a)  and 
10(b).  Let  the  right  column  in  each  figure  be  in  the  input  plane  of  a  holographic  interconnection 
system  and  the  left  column  in  the  output  plane.  In  each  case,  a  single  sinusoidal  phase  grating  is 
present.  In  Fig.  10a,  the  grating  is  intended  to  connect  the  pixels  labeled  (49,0)  and  (-49,0)  in  the 
input  and  output  planes,  respectively.  In  Fig.  10(b),  the  grating  connects  pixels  (49,1)  and 
(-49,0).  By  turning  on  the  63  pixels  in  the  right  hand  column  and  monitoring  the  diffracted  light  at 
each  of  the  63  pixels  in  the  left  hand  column,  we  can  evaluate  the  amount  of  crosstalk  in  each  case 
caused  by  the  presence  of  a  single  grating. 

In  the  case  of  Fig.  10a,  we  naturally  expect  each  pixel  in  the  input  plane  to  be  connected  to  its 
opposite  pixel  in  the  output  plane  since  the  interconnection  grating  is  degenerate  to  all  of  these 
interconnections.  This  is  illustrated  in  Fig.  1 1(a)  in  which  the  expected  diffracted  outputs  of  the 
output  plane  pixels  are  shown  as  a  function  of  the  pixel  index  [each  diffracted  output  is  assumed  to 
have  I  %  diffraction  efficiency  for  consistency  with  the  numerical  simulation  results  discussed 
below  and  shown  in  Fig.  1 1(b)].  As  shown  in  Fig.  1 1(c),  we  expect  the  grating  of  Fig.  10(b)  to 
yield  a  diffracted  output  only  at  pixel  (49,0)  in  the  output  plane  when  all  63  input  plane  pixels  are 
turned  on.  Since  no  other  pairs  of  pixels  lie  on  the  lines  of  degeneracy  for  this  interconnection 
grating,  no  other  diffracted  outputs  should  be  present. 

We  tested  the  above  predictions  by  using  a  three  dimensional  version  of  BPM  to  propagate 
the  63  readout  beams  through  a  holographic  medium  in  which  the  desired  interconnection  grating 
was  recorded.  We  assumed  an  angular  separation  of  18  degrees  between  the  pixel  columns,  a 
readout  wavelength  of  514  nm,  and  a  5.0  mm  thick  holographic  medium  having  a  refractive  index 
of  2.48.  The  pixel  separation  in  the  input  and  output  planes  was  taken  to  be  0.97  mm.  The  focal 
length  of  the  collimating  lens  between  each  pixel  plane  and  the  holographic  medium  was  assumed 
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a)  Grating  interconnection  between  pixeis  (49, 0)  and  (-49,0) 
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Fig.  10.  (a)  Gratiag  laterconnectlon  between  pixels  (49,0)  and  (-49,0).  (b)  Grating 

interconnection  between  pixels  (49,1)  and  (-49,0). 
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lo  be  300  mm.  The  grating  strength  of  the  interconnection  grating  was  designed  to  yield  1  % 
diffraction  efficiency  for  a  single  input  to  a  single  output. 

Simulation  results  are  shown  in  Figs.  1 1(b)  and  11(d).  As  seen  in  Fig.  11(b),  Bragg 
degeneracy  (the  case  of  Fig.  10a)  docs  indeed  cause  each  input  plane  pixel  to  be  connected  to  its 
opposite  output  plane  pixel,  and  the  simulation  matches  our  expected  results  [Fig.  1 1(a)]! 

However,  as  seen  by  comparing  Figs.  1 1(c)  and  1 1(d),  there  are  more  diffracted  outputs  in  the 
simulation  results  than  expected  for  the  interconnection  grating  shown  in  Fig.  10b.  This  is  due  to 
angular  width  Bragg  degeneracy,  which  causes  pixel  (49,i)  in  the  input  plane  to  be  connected  to 
pixel  (-49,i-l)  in  the  output  plane.  For  example,  even  though  pixels  (49,0)  and  (-49,-1)  are  not 
exactly  on  the  degeneracy  lines  of  the  interconnection  grating,  they  are  close  enough  to  be  well 
within  AQ  of  the  corresponding  Bragg  angles  of  the  grating  and  therefore  are  connected  by  the 
grating.  The  efficiency  of  the  interconnection  depends  on  how  angularly  detuned  a  given  pixel  pair 
is  from  the  grating.  This  is  seen  in  Fig.  1 1  (d)  in  which  the  light  detected  at  the  output  pixels  is 
shaped  like  the  angular  sensitivity  of  the  interconnection  grating. 

One  can  summarize  angular  width  Bragg  degeneracy  by  noting  that  the  degeneracy  lines 
shown  in  Fig.  8  are  not  infinitesimally  thin.  Rather  (as  seen  in  Fig.  12),  they  have  a  finite 
thickness  that  is  dependent  on  the  angular  width  of  the  interconnection  grating  to  which  they 
correspond.  This  can  result  in  undesired  interconnections  for  fractal  sampling  grids  designed  with 
the  no  rectangle  rule.  In  the  next  section  we  illustrate  the  effects  of  angular  width  Bragg 
degeneracy  on  the  reconstruction  fidelity  of  weighted  interconnections. 

4.2.3  Effects  on  Weighted  Interconnections 

Angular  width  Bragg  degeneracy  can  have  just  as  severe  an  effect  on  the  reconstruction 
fidelity  of  weighted  interconnections  as  normal  (Bragg-matched)  Bragg  degeneracy.  In  this  section 
we  discuss  a  set  of  numerical  simulations  of  9-to-9  weighted  interconnections  performed  using 
BPM.  By  comparing  the  RMS  error  of  the  normalized  diffracted  outputs,  we  quantitatively 
examine  the  fidelity  performance  of  the  9-to-9  holographic  interconnection  system  in  the  presence 
(among  the  desired  interconnections)  of  normal  Bragg  degeneracy  (C^se  A),  angular  width  Bragg 
degeneracy  (Case  B),  and  no  form  of  Bragg  degeneracy  (Case  C). 

In  our  BPM  simulations,  the  same  assumptions  apply  as  for  the  simulations  in  Section  1.3.1 
except  for  the  following  additions.  First,  the  holographic  medium  is  assumed  to  have  a  linear 
recording  characteristic  relative  to  the  incident  write  beam  intensity.  Second,  only  the  desired 
interconnection  gratings  are  assumed  to  be  present  in  the  medium;  no  cross  gratings  exist  between 
pixels  within  the  input  or  output  planes  (as  would  be  created  using  a  simultaneous  recording 
method,  for  example). 
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Fig.  12.  Degeneracy  lines  have  finite  thickness,  corresponding  to  the  Bragg  width  of 
a  given  interconnection  grating.  This  results  in  additional  degenerate 
interconnections  (compare  dashed  interconnection  lines  above  with  Fig.  1). 

The  pixel  geometry  for  Case  A  is  shown  in  Fig.  13(a).  The  pixels  in  both  the  input  and 
output  planes  are  each  arranged  on  a  regular  3  by  3  grid,  and  thus  significant  Bragg  degeneracy  is 
present  in  the  interconnection  system.  The  weights  are  the  same  between  all  of  the  input  plane 
pixels  and  a  given  pixel  in  the  output  plane.  The  relative  weights  for  the  top  row  of  pixels  in  the 
output  plane  (left  to  right)  are  0.8, 0.5,  and  0.2.  Similarly,  the  relative  weights  for  the  second  and 
third  rows  are  0.9,  0.6,  0.3,  and  1.0,  0.7,  0.4,  respectively.  All  nine  input  plane  pixels  are  used 
during  readout,  each  generating  a  unit  intensity  plane  wave  at  the  front  face  of  the  holographic 
medium.  In  the  absence  of  fidelity  errors,  the  diffracted  outputs  at  each  pixel  should  therefore  have 
the  same  relative  detected  power  as  the  weight  corresponding  to  that  pixel.  The  ideal  diffracted 
outputs  for  Case  A  are  shown  in  Fig.  14(a). 

Simulation  results  for  Case  A  are  shown  in  Fig.  14(b).  Since  presumably  only  the  diffracted 
outputs  at  the  nine  original  output  plane  pixels  are  of  interest,  these  are  shown  in  Fig.  14(c).  Fig. 
14c  therefore  represents  the  actual  detected  outputs  (assuming  that  detectors  are  placed  only  at  the 
desired  output  pixels),  while  Fig.  14(b)  represents  all  of  the  diffracted  outputs  generated  by  the 
interconnection  system.  Those  outputs  that  are  present  in  Fig.  14(b)  but  not  in  14(c)  (for  example. 
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Fig.  14.  Simulation  results  Tor  Case  A  (see  text). 


at  pixels  (-49,  2),  (-50,3),  and  (-51,2))  are  due  to  Bragg  degeneracy.  They  naturally  do  not 
directly  affect  the  detected  outputs. 

As  seen  by  comparing  in  Figs.  14(a)  and  14(c),  there  are  significant  visible  differences 
between  the  ideal  and  actual  outputs  of  the  interconnection  system.  The  relative  magnitudes  of  the 
detected  outputs  are  obviously  incorrect.  One  quandtative  measure  of  the  amount  of  error  in  the 
detected  outputs  is  the  RMS  error  of  the  normalized  outputs.  If  the  ideal  outputs  are  represented  by 
the  vector  >  (each  component  of  which  is  the  ideal  magnitude  of  one  of  the  nine  pixels  in  the  output 
plane)  and  the  actual  diffracted  outputs  by  the  vector  y'  (each  component  being  the  detected  output 
at  the  pixel  corresponding  to  the  same  component  in  y),  the  RMS  error  of  the  normalized  outputs, 
e,  is 
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in  which  N  =  9.  The  maximum  value  of  this  error  is  ^^2.  We  have  found  that  values  of  0.1 
correspond  to  large  errors  in  the  relative  magnitude  of  individual  outputs  (up  to  25%  or  more). 

The  RMS  error  for  Case  A  is  0.173,  which  (as  seen  in  Fig.  14(c)]  represents  a  significant 
deviation  from  the  desired  outputs.  As  will  be  seen  upon  examining  Case  C  (in  which  there  is  no 
Bragg  degeneracy),  most  of  the  error  in  Case  A  is  due  to  Bragg  degeneracy. 

The  pixel  geometry  for  Case  B  is  shown  in  Fig.  13(b).  Note  that  it  is  a  subset  of  the  fractal 
sampling  grid  shown  in  Fig.  9(a)  that  was  generated  using  the  no  rectangle  rule.  The  same  relative 
weights  and  readout  intensities  as  for  Case  A  are  assumed  for  the  BPM  simulation  of  Case  B.  The 
ideal  outputs  are  shown  in  Fig.  15(a),  the  diffracted  outputs  in  Fig.  15(b),  and  the  detected  outputs 
in  Fig.  15(c).  At  first  glance,  the  detected  outputs  appear  to  be  similar  to  the  ideal  outputs. 
However,  closer  examination  reveals  that  the  pixels  having  a  y-axis  index  of  4  are  nearly  a  factor 
of  two  too  small  compared  to  the  other  pixels.  Indeed,  the  RMS  error  for  this  case  is 
0. 1 82-slightly  larger  than  Case  A's! 

This  poor  fidelity  performance  is  due  to  angular  width  Bragg  degeneracy.  As  an  example, 
consider  the  two  interconnections  indicated  by  the  dashed  lines  in  Fig.  13(b).  Although  the  four 
pixels  involved  do  not  form  the  vertices  of  a  rectangle,  a  simple  calculation  shows  that  the  upper 
set  of  pixels  is  well  within  40  of  the  grating  connecting  the  lower  set  of  pixels,  and  vice  versa. 
Thus,  angular  width  Bragg  degeneracy  is  present  among  these  interconnections.  Similarly,  the 
interconnections  between  each  pixel  in  the  middle  row  of  the  input  plane  to  each  pixel  of  the  middle 
row  of  the  output  plane  arc  degenerate  with  the  corresponding  interconnections  between  pixels  in 
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the  bottom  rows.  This  angular  width  Bragg  degeneracy  is  the  reason  that  the  detected  outputs  of 
the  pixels  in  Fig.  14(c)  along  rows  1  and  -3  are  nearly  twice  as  large  as  those  along  row  4. 

Note  that  angular  width  Bragg  degeneracy  occurs  in  a  pixel  geometry  that  satisfies  the  no 
rectangle  rule.  This  result  indicates  that  the  no  rectangle  rule  is  not  a  sufficient  condition  for  the 
complete  avoidance  of  any  form  of  Bragg  degeneracy.  This  is  in  direct  contradiction  to  References 
12  and  13. 

The  pixel  geometry  for  Case  C  is  shown  in  Fig.  13(c).  It  is  a  subset  of  the  fractal  sampling 
grid  generated  using  the  unequal  row  separation  rule  shown  in  Fig.  9(b).  Again,  the  relative 
weights  and  readout  beam  intensities  are  the  same  as  above.  The  ideal  outputs  are  shown  in 
Fig.  16(a).  The  diffracted  actual  outputs  from  the  BPM  simulation  are  shown  in  Fig.  16(b).  Note 
that  extraneous  diffraction  outputs  are  visible  in  rows  -4,  -1, 1,  and  4.  Angular  width  Bragg 
degeneracy  alone  is  responsible  for  the  outputs  in  rows  -1  and  1,  while  a  combination  of  angular 
width  and  normal  (Bragg-matched)  Bragg  degeneracy  causes  the  outputs  in  rows  -4  and  4.  Since 
the  outputs  in  these  rows  are  not  detected  they  naturally  do  not  directly  affect  the  fidelity  of  the 
detected  outputs. 

The  detected  outputs  are  shown  in  Fig.  16(c).  A  visual  comparison  with  the  ideal  outputs  in 
Fig.  16(a)  shows  significantly  improved  fidelity  over  the  previous  cases.  This  is  verified  by 
comparing  the  RMS  error  of  the  normalized  outputs,  which  for  Case  C  is  0.0129.  This  is  over  an 
order  of  magnitude  lower  than  the  errors  obtained  for  Cases  A  and  B.  Selection  of  a  pixel 
geometry  that  eliminates  all  forms  of  Bragg  degeneracy  among  the  desired  interconnections  thus 
significantly  improves  the  holographic  interconnection  system’s  reconstruction  fidelity.  The 
unequal  row  separation  rule  allows  the  design  of  such  pixel  sampling  grids. 

In  summary,  we  have  demonstrated  a  previously  unrecognized  form  of  Bragg  degeneracy 
that  is  due  to  the  angular  width  of  the  Bragg  diffraction  responses  of  volume  holographic  gratings. 
As  shown  by  numerical  simulation,  this  angular  width  Bragg  degeneracy  can  degrade 
interconnection  fidelity  as  much  as  normal  (Bragg-matched)  Bragg  degeneracy.  Use  of  the  no 
rectangle  rule  to  design  fractal  sampling  grids  does  not  guarantee  the  avoidance  of  angular  width 
Bragg  degeneracy  among  the  desired  interconnections  when  implementing  a  given  interconnection 
system.  However,  all  forms  of  Bragg  degeneracy  among  the  desired  interconnections  can  be 
avoided  if  the  unequal  row  separation  rule  is  used  in  the  design  of  the  fractal  sampling  grids.  This 
is  therefore  the  preferred  design  rule  to  use  with  holographic  interconnection  systems  in  which 
Bragg  degeneracy  can  occur. 

Having  explored  some  of  the  practical  aspects  of  angular  width  Bragg  degeneracy,  we  turn  now  to 
a  brief  discussion  of  a  more  fundamental  issue;  namely,  does  this  form  of  Bragg  degeneracy 
impose  any  additional  limitations  on  the  interconnection  capacity  of  volume  holograms?  The 
answer  at  this  point  appears  to  be  no.  As  discussed  in  Section  3,  one  can  calculate  the  amount  of 
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Fig.  16.  Simulation  results  for  Case  C  (see  text). 


grating  k-space  that  is  accessible  from  two  finite  pixel  planes.  Based  on  the  dimensions  of  the 
holographic  medium,  a  given  grating  occupies  a  fixed  volume  of  this  k-space.  The  size  of  each 
grating's  k-space  volume  corresponds  inversely  to  the  angular  width  of  the  grating's  Bragg 
response.  One  can  calculate  the  number  of  interconnection  gratings  that  can  be  stored  in  the 
accessible  volume  of  k-space  by  simply  dividing  by  the  volume  occupied  by  an  individual  grating. 
Angular  width  Bragg  degeneracy  does  not  alter  this  fundamental  fact.  Rather,  it  relates  to  how 
specific  sampling  grids  are  chosen  such  that  the  resultant  interconnection  gratings  do  not  overlap  in 
the  accessible  grating  k-space.  Awareness  of  angular  width  Bragg  degeneracy  is  thus  imponant  in 
the  implementation  of  an  interconnection  geometry,  but  does  not  affect  the  fundamental  number  of 
interconnections  that  the  geometry  can  intrinsically  access. 

4.3  BEAM  COUPLING 

A  second  major  source  of  distortion  in  holograms  is  energy  transfer  between  Bragg- 
matched  beams  via  two-wave  mixing.  This  occurs  because  when  an  input  beam  is  Bragg-matched 
to  a  grating,  the  reconstructed  beam  is  automatically  Bragg-matched.  This  beam  can  then  itself 
read  out  the  same  grating  and  reconstruct  the  input  beam.  If  the  gain-length  product  of  the  grating 
is  sufficiently  high,  energy  transfer  or  beam  coupling  can  occur  between  the  beams,  resulting  in 
distortions.  Several  workers  have  analyzed  the  effects  of  beam  coupling  on  holographic  image 
quality.  *^’*5  They  found  that  distortion  increases  rapidly  as  the  gain-length  product  of  the  grating 
increases,  as  illustrated  in  Fig.  17.  Therefore,  a  straightforward  technique  for  reducing  beam¬ 
coupling  distortion  is  to  reduce  the  gain-length  product.  However,  this  also  results  in  low 
diffraction  efficiencies  and  low  signal-to-noise  ratios. 
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Figure  17. 


Cross-coupling  error  in  superimposed  volume  gratings  as  a  function  of  grating- 
strength  X  interaction  length. 
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SECTION  5 

CASCADED-GRATING  HOLOGRAPHY 


5.1  CASCADED-GRATING  WEIGHT  STORAGE 

We  have  developed  a  holographic  recording  technique  called  cascaded  grating  holography 
(CGH)  which  greatly  reduces  the  distortions  discussed  in  the  previous  section.  The  essence  of  the 
CGH  idea  is  to  use  a  set  of  angularly  and  spatially  multiplexed  gratings  to  store  each  weight  rather 
than  a  single  grating.  By  forcing  a  light  beam  to  match  the  Bragg  condition  at  each  of  a  cascaded 
series  of  spatially  and  angularly  distributed  gratings  (Fig.  18),  crosstalk  due  to  Bragg  degeneracy 
is  greatly  reduced.  From  the  k-space  construction  of  Fig.  19,  it  is  clear  that  two  gratings  in  series 
will  connect  only  a  single  input/output  pair  of  beams  via  an  intermediary  diffracted  beam.  All  other 
beam  triplets  (input,  intermediaiy,  and  output  beams)  will  not  be  able  to  match  the  Bragg 
conditions  at  both  gradngs  because  the  intermediary  diffracted  beam  will  not  lay  on  the  Bragg 
degeneracy  cone  of  the  second  grating.  An  undesired  beam  on  the  Bragg  degeneracy  cone  of  one 
grating  is  therefore  rejected  by  the  remaining  gratings.  This  allows  the  neurons  to  be  arranged  in 
arbitrary  patterns  on  the  SLM,  increasing  the  storage  capacity  (since  all  pixels  can  be  used)  and 
throughput. 

We  have  investigated  two  techniques  for  generating  such  multiple-grating  connection 
weights  in  photoreffactive  materials:  the  self-pumped  phase  conjugate  mirror  (PCM)  and  forward- 
scattering  beam  fanning.  A  self-pumped  PCM^^  generates  the  "phase  conjugate"  or  time-reversed 
version  of  an  image-bearing  coherent  input  beam.'^  In  the  early  stages  of  our  research  we 
conjectured  that  the  temporal  evolution  of  the  phase  conjugation  process  in  self-pumped 
photorefractive  PCMs,  which  include  dynamic  multiple-wave  mixing  terms,  will  be  similar  to  the 
single  grating  case  in  that  the  amplitude  diffraction  efficiency  of  the  conjugate  beams  will  increase 
with  the  outer-product  of  the  writing  beam  amplitudes,  thereby  emulating  the  Hebbian  learning 
rule.  In  addition,  multiple  cascaded  gratings  are  generated  which  should  eliminate  Bragg 
degeneracy.  This  in  turn  implies  that  the  multiple  grating  per  weight  approach  can  be  implemented 
using  self-pumped  PCMs  and  used  to  form  neural  networks  which  obey  Hebbian  learning  rules. 
Moreover,  the  observed  distributions  of  beams  within  a  self-pumped  PCM,  which  are  determined 
by  the  high  coupling  gain  of  BaTi03,  scattering  centers,  reflections  from  crystal  faces,  and  the 
geometry  of  the  PCM  configuration, * 8  suggest  that  light  beams  from  the  entire  input  plane  mix  in 
the  crystal.  This  results  in  the  global  interconnection  of  input  pixels  by  a  self-pumped  PCM, 
especially  if  the  PCM  is  in  the  Fourier  plane  of  the  input  spatial  light  modulator.  since  a  beam 
from  one  pixel  must  diffract  from  a  large  set  of  spatially  distributed  gratings  in  order  to  form  the 
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SINGLE-GRATING  WEIGHTED 
CONNECTION  BETWEEN  I  AND  J 
(SUFFERS  FROM  BRAGG  DEGENERACY) 
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MULTIPLE-GRATING  WEIGHTED 
CONNECTION  BETWEEN  BEAMS  I  AND  J 
(NO  BRAGG  DEGENERACY) 


Fig.  18.  Optical  connections  made  by  scattering  from  multiple  cascaded 
cross-talk  due  to  Bragg  degeneracy  and  allow  full  utilization  of 
planes. 


gratings  reduce 
input  and  output 


33 


Fig.  19.  Ttaree-dimensionai  k<space  diagram  for  satisfaction  of  Bragg  conditions  at 
two  gratings  simultaneously.  Only  one  triplet  of  light  beams  (colored 
white)  satisfy  the  Bragg  conditions  of  both  gratings  simultaneously. 

conjugate  of  a  second  pixel,  the  Bragg  degeneracy  crosstalk  should  be  low  according  to  the 
arguments  presented  previously. 

We  perfonned  a  series  of  experiments  to  test  these  conjectures  for  the  grating  selectivity 
and  global  connectivity  of  the  PCM  approach.  The  setup  illustrated  in  Fig.  20  was  used  to  test  the 
vertical  and  horizontal  angular  selectivity  of  a  self-pumped  PCM.  A  BaTi03  crystal  was  placed  in 
the  back  focal  plane  of  a  lens  in  the  correct  orientation  for  self-pumped  phase  conjugation.  To 
measure  the  horizontal  selectivity,  the  input  laser  beam  passed  through  a  Ronchi  ruling  oriented 
with  the  stripes  vertical.  The  beam  was  focused  into  the  PCM.  The  phase  conjugate  or  "time- 
rcvcrscd"  beam  was  then  reflected  from  a  beam  splitter  and  focused  into  a  photodiode  detector. 
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Fig.  20.  Experimental  configuration  for  testing  Bragg  .selectivity  of  self-pumped  phase  conjugate 
mirror  in  readout  phase. 


After  steady  state  was  reached,  the  laser  power  was  lowered  to  reduce  the  rate  of  formation  of  new 
gratings  and  a  recording  was  made  of  the  conjugate  signal  as  the  Ronchi  ruling  was  translated 
horizontally.  Assuming  sufficient  angular  selectivity,  the  conjugate  signal  should  oscillate  as  the 
ruling  is  translated.  Specifically,  for  zero  crosstalk  the  output  vs  time  should  be  the  autocorrelation 
function  of  the  Ronchi  ruling.  The  smaller  the  depth  of  the  oscillation,  the  greater  the  crosstalk. 
Similarly,  to  measure  vertical  selectivity  the  Ronchi  ruling  was  rotated  90°  and  translated  vertically. 
The  experimental  results  are  shown  in  Fig.  21.  For  this  case  the  tested  angular  separation  was 
1.7x10'3  radians,  which  is  sufficient  for  10^  to  10^  neurons  assuming  reasonable  lens  focal 
lengths  and  SLM  dimensions.  As  can  be  seen  in  the  figure,  the  extinction  in  both  directions  is 
complete,  proving  that  Bragg  degeneracy  was  eliminated. 

Global  connectivity  was  demonstrated  by  reading  out  a  complete  stored  image  using  less 
than  3%  of  the  original  image  as  input.  In  short  we  found  that  holograms  with  good  image  quality, 
minimal  Bragg  degeneracy,  and  global  connectivity  could  be  stored  in  the  PCM  configuration. 

We  repeated  these  experiments  using  a  mutually-pumped  PCM  and  obtained  similar  results. 

Despite  the  positive  results  we  obtained  for  the  PCM  method  in  terms  of  Bragg  degeneracy 
and  global  connectivity,  we  found  that  we  could  not  implement  neural  networks.  Investigating 
further,  we  found  that  the  dynamic  characteristics  of  self-  and  mutually-pumped  photoreffactive 
PCMs  resulted  in  inter-hologram  aosstalk  when  multiple  exposures  were  recorded  in  the  crystal. 

In  particular,  we  found  that  previously  recorded  holograms  influenced  the  formation  of  new 
holograms  in  such  a  way  that  extraneous  connections  were  made.  For  example,  if  two  orthogonal 
images  A  and  B  (e.g.,  with  no  common  pixels)  were  sequentially  recorded  in  the  PCM, 
subsequent  readout  with  either  image  would  recall  the  superposed  phase  conjugates  of  both 
images. 

This  effect  is  illustrated  in  Fig.  22  which  plots  the  conjugate  signals  for  the  two  images  as 
the  input  was  repeatedly  switched  between  A  and  B.  Note  that  when  the  input  is  switch  to  B  from 
A,  both  A  and  B*  appear.  (As  the  exposure  continues  B  eventually  decreases  to  zero  as  B's 
gratings  are  erased.)  This  effect  is  analogous  to  the  "hologram  sharing"  effect  in  mutually-pumped 
PCMs  in  which  one  hologram  formed  by  an  input  beam  affects  the  formation  of  a  second  hologram 
formed  by  a  second  mutually  incoherent  input  beam.^J  Gratings  which  can  be  "shared"  between 
two  holograms  have  higher  gains  over  other  gratings  and  tend  to  be  enhanced.  This  in  fact  is  the 
principle  behind  mutually-pumped  PCMs  in  which  two  mutually  incoherent  beams  can  read  out 
each  other's  conjugate  beam.  In  the  case  considered  here  the  exposures  of  the  two  holograms  are 
separated  in  time  rather  than  simultaneous,  but  the  principle  is  the  same. 

The  hologram  sharing  effect  results  in  very  strong  inter-hologram  crosstalk  and  is 
detrimental  to  optical  neural  networks.  This  led  us  to  consider  a  second  non-phase-conjugate 
method  which  uses  the  same  multi-grating  principle  to  avoid  Bragg  degeneracy  but  does  not  suffer 
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Fig.  22,  Time-sequenced  exposure  of  self-pumped  phase  conjugate  mirror  with  two 

orthogonal  inputs  A  and  B.  Presence  of  both  conjugates  A*  and  B  simultaneously 
is  due  to  effects  of  hologram  sharing  and  indicates  inter-hologram  crosstalk. 

from  hologram  sharing.  This  method  is  based  on  the  observed  phenomena  of  beam  fanning  in 
photorefractive  crystals. 

"Beam  fanning"  is  a  well-known  effect  in  high  gain  photorefractive  crystals  in  which  an 
input  beam  is  initially  scattered  by  small  inhomogeneities  in  the  crystal,  resulting  in  low  amplitude 
scattered  optical  noise.  The  noise  beams  then  interfere  with  the  original  input  beam  and  write 
gratings.  Scattering  of  the  input  beam  by  these  gratings  selectively  amplifies  some  of  the  noise 
beams  by  the  process  of  energy  transfer  in  photorefractive  two-wave  mixing.  The  amplified  beams 
then  write  new  gratings  and  the  process  cascades.  Which  beams  are  amplified  most  is  determined 
by  the  electrooptic  tensor  of  the  crystal  and  the  orientation  of  the  input  beam.  The  net  effect  is  that 
the  input  beam  literally  fans  in  the  crystal  as  it  writes  a  set  of  spatially  and  angularly  dist.ibuted 
gratings. 

As  shown  in  Fig.  23,  this  fanned  light  can  be  used  as  a  reference  beam  when  it  is  interfered 
with  a  second,  unfanned  object  beam  to  form  a  holographic  connection  which  suffers  neither  from 
Bragg  degeneracy  (because  multiple  cascaded  gratings  store  each  connection)  nor  from  hologram 
sharing  (because  the  conjugate  beam  is  not  used  for  readout).  An  unfanned  object  beam  is  used 
because  object  beam  fanning  would  degrade  the  quality  of  the  reconstructed  object  image.  Fanning 
can  be  controlled  so  that  the  reference  beam  fans  and  the  object  beam  doesn't  by  adjusting  the 
orientation  of  the  beams  relative  to  the  crystal.  The  fanning  process  generates  high  gain-length 
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Fig.  23.  Recording  of  connection  using  fanned  reference  beam. 
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product  "fanning  gratings"  which  divide  each  reference  beam  into  a  set  of  beams  at  different 
orientations  and  locations.  During  recording  the  object  beams  form  a  set  of  "signal  gratings'  in 
which  the  connection  weights  are  stored.  Both  the  signal  and  fanning  gratings  are  angularly  and 
spatially  multiplexed.  Upon  readout  each  reference  beam  must  match  the  Bragg  condition  at  a 
multitude  of  fanning  gratings,  which  breaks  the  Bragg  degeneracy.  In  addition,  the  beamlets 
reconstructed  by  the  signal  gratings  are  all  in  phase  and  sum  coherently,  hence  the  aggregate 
diffraction  efficiency  can  be  high  even  though  the  diffraction  efficiency  of  any  individual  signal 
grating  is  small.  In  addition,  the  low  gain-length  product  of  the  individual  signal  gratings  greath' 
reduces  distortions  due  to  beam  coupling. 

We  have  applied  crosstalk  tests  to  the  fanning  method  as  described  above  for  the  PCM 
method  and  found  that  the  Bragg  degeneracy  advantages  of  PCM  are  maintained  but  hologram 
sharing  is  eliminated.  Recordings  of  holograms  in  a  fanning  crystal  of  BaTiOs  do  not  show  Bragg 
degeneracy,  hologram  sharing  crosstalk,  or  distonions  due  to  beam  coupling.  An  example  of 
holographic  recording  using  an  arbitrary  2-D  gray-scale  reference  image  is  shown  in  Fig.  24. 
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Fig.  24.  Experimental  demonstration  of  recorded  image  quality  using  fanned  reference  beam 
and  2-D  non-subsampled  reference,  (a)  Object  and  reference  used  for  recording. 
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Fig.  24.  ExperimenUil  demonstration  of  recorded  image  quality  using  fanned  reference  beam 
and  2>D  non-subsampled  reference,  (b)  Original  object  for  comparison. 

(c)  Hologram  of  object  reconstructed  using  50%  of  original  gray>scale  reference. 
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The  object  and  reference  both  consisted  of  combinations  of  Pentagon  and  woman  images  displayed 
on  a  LCTV  with  30,000  gray-scale  pixels  (90,000  pixels  if  the  individual  red,  green,  and  blue 
pixels  of  the  color  LCTV  are  counted).  The  original  object  (showing  the  system’s  optical  quality) 
and  reconstructed  hologram  are  shown  in  Figs.  24(b)  and  24(c).  Note  that  only  50%  of  the 
original  reference  was  used  in  reading  out  the  hologram.  The  reconstructed  image  is  virtually 
identical  to  the  original,  as  shown  in  Fig.  25  which  is  a  3-D  plot  of  a  detail  in  the  original  and 
holographic  images.  Upon  magnification  of  the  reconstructed  hologram,  each  of  the  45,000  LCTV 
pixels  in  the  original  object  was  clearly  discernible. 

A  measurement  of  the  holographic  fan-out  is  illustrated  in  Fig.  26.  In  this  experiment  a 
uniform  (all  pixels  on)  reference  was  used  to  record  the  hologram.  During  readout  an  opaque 
screen  with  an  adjustable  small  round  aperture  was  translated  in  front  of  the  reference  plane, 
allowing  us  to  measure  the  weight  vector  connected  to  that  portion  of  the  reference.  In  this 
experiment  the  aperture  area  was  0.0091  of  the  reference,  corresponding  to  a  fanout  of  1 10. 
(Measurements  of  larger  fanouts  were  limited  by  the  sensitivity  of  our  CCD  camera.)  It  is  evident 
from  the  photograph  that  the  entire  object  was  reconstructed  when  read  out  with  a  small  firaction  of 
the  reference,  demonstrating  global  connectivity. 

We  also  measured  the  time  required  to  write  holograms  to  saturation  using  fanned  reference 
beams.  We  used  a  two  slotted  wheels  on  a  common  rotating  shaft  to  periodically  switch  off  the 
object  beam  in  order  to  measure  the  buildup  in  diffraction  efficiency  as  the  hologram  was  being 
written.  The  results  are  shown  in  Fig.  27  for  a  crystal  of  BaTi03  which  was  cut  at  45®  to  the 
c-axis.  This  crystallographic  orientation  maximizes  the  effective  electrooptic  coefficient.  In  this 
case  the  wavelength  was  in  the  green  (514  nm)  and  the  total  optical  power  incident  on  the  crystal 
was  17.5  mw.  The  time  required  to  write  to  saturation  was  25  msec.  Weights  are  typically 
adjusted  by  small  amounts  during  the  learning  phase,  therefore  all  the  weights  connecting  two 
neuron  layers  can  be  adjusted  in  less  than  I  msec.  This  would  correspond  to  learning  rates  of 
greater  than  10^  *  connection  updates  per  sec  if  each  layer  contained  10^  neurons.  Since  the 
photorefractive  time  constant  is  inversely  proponional  to  the  optical  power,  the  update  rate  can  be 
further  increased  by  raising  the  laser  power. 

Improvements  in  packaging  size  and  cost  would  be  obtained  if  a  laser  diode  light  source 
could  be  used.  Currently  available  BaTi03  is  less  sensitive  at  the  infrared  laser  diode  wavelengths 
and  the  photorefractive  gain  is  less.  Nevertheless,  we  were  successful  in  demonstrating  cascaded- 
grating  holography  in  45°  BaTi03  using  a  laser  diode  light  source  operating  at  830  nm.  We  used  a 
150  mw  single  mode  laser  diode  from  SpectraDiode  Labs.  The  total  optical  power  at  the  crystal 
was  3 1  mw.  As  shown  in  Fig.  28,  we  could  resolve  a  holographic  image  with  exposure  times  as 
short  as  100  msec.  Nevertheless,  due  to  the  much  reduced  sensitivity  of  BaTi03  at  830  nm,  the 
time  constant  for  hologram  writing  was  much  longer  than  at  514  nm.  Referring  to  Fig.  29,  we 
measured  a  saturation  time  of  10  sec  for  an  incident  power  of  45  mw  at  830  nm.  Thus  more  work 
is  needed  to  improve  the  infrared  sensitivity  of  the  recording  medium  for  use  with  laser  diodes. 
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ORIGINAL  OBJECT  DETAIL 


HOLOGRAM  RECONSTRUCTED 
USING  50%  OF  GRAY-SCALE  REFERENCE 


Fig.  25.  3-D  plots  showing  detail  of  Pentagon  images  in  Fig.  11.  (a)  Original  object. 

(b)  Hologram  of  object  reconstructed  using  50%  of  original  gray-scale  reference. 
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(a< 


Fig.  26.  Experimental  demonstration  of  fanout  and  global  connectivity  using  fanned 

reference  beam,  (a)  Object  and  reference  used  for  recording,  (b)  Hologram  of  object 
using  100%  of  original  reference,  (c)  Hologram  reconstructed  using  0.9%  of 
original  reference  showing  fanout  of  1:110  and  global  connectivity.  Demonstration 
of  larger  fanout  was  limited  by  CCD  sensitivity. 
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45°-CUT  BaTi03  (210-H),  >.=  514  nm,  Pr  =  8.5  mW,  Pq  =  9.0  mW 


M  I 


25  ms 


Fig.  27.  Hologram  writing  time  (fanned  reference). 

Alternatively,  compact  solid  state  lasers  emitting  in  the  green  could  be  used  instead.  The  impact  of 
these  options  on  package  design  are  discussed  in  the  packaging  section  of  this  repon. 

5.2  HOLOGRAM  SUPERPOSITION 

The  learning  pha.se  of  neural  network  models  involves  many  parallel  weight  adjustments  in 
response  to  an  internally  or  externally  generated  error  signal.  In  holographic  optical  neural 
networks  this  function  is  performed  through  the  superposition  of  many  exposures  of  the 
photorefractive  crystal  in  which  the  strengths  of  individual  gratings  are  strengthened  or  weakened. 
Ideally,  the  holographic  process  should  not  diston  the  linear  superposition  of  weight  update 
vectors.  In  the  following  we  derive  necessary  conditions  for  linearity  using  a  simple  exponential 
decay  model.  We  also  point  out  that  weight  erasure  can  diston  the  neural  network  learning  rule  if 
precautions  are  not  taken. 
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Fig  28.  IR  hologram  in  45°>cut  BaTi03  (210>H)  (fanned  uniform  reference).  X  =  830  nm, 
Pr  =  21  mW,  Po  =  mW. 
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45°-CUT  BaTi03  (210-H),  A.=  830  nm,  Pr  =  21  mW,  Pq  =  24  mW 


-  10  sec 


Fig.  29.  Hologram  writing  time  (fanned  reference). 

Following  Valley22,  we  assume  a  quasi-CW  approximation  for  the  photoreffactive 
equations  developed  by  Kukhtarev  et  al..23  The  coupled  equations  describing  the  time  and  space 
evolution  of  two  plane  waves  with  amplitudes  Ap  and  Ag  are  then  given  by 


Bx 


P 


dA^  _  ik  A  c* 
-)  ^  P 

^P 


dx^  In, 


dt  X  T/q 
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where 


and 


T=tdi 


The  quantities  r^ff,  e,  Nj),  N/^,  a,  p.,  ng,  np,  and  tR  are  material  constants,  k=27t/X,  and  kg  is 

the  wave  number  of  the  space  charge  field  grating.  The  above  equations  are  applicable  to  crystals 
in  which  one  charge  carrier  dominates.  The  effects  of  absorption  on  the  spatial  derivatives  of  Ap 
and  As  are  neglected. 

In  this  simplified  model  the  response  time  T  is  inversely  proponional  to  the  total  light 
intensity  Iq.  Let  us  define  a  normalized  response  time  X]=IoT  which  is  independent  of  the  total 

light  intensity.  The  equation  for  the  time  evolution  of  the  grating  space  charge  field  then  can  be 
written  as 

dE  _  Iq  ^  ^ 
dt  Tj  T) 


The  saturation  value  of  the  space  charge  field  is 


The  incremental  change  to  the  space  charge  field  is  given  by 


SE  =  -^ESt*‘^^AXSl 


The  second  term  is  the  desired  Hebbian  contribution  to  E.  It  is  modified  by  the  first  term 
which  represents  weight  erasure.  Weight  erasure  reduces  the  magnitude  of  a  weight  vector  that  is 
not  currently  being  updated  (Ap  and/or  As  equal  to  0)  but  not  its  orientation  because  5E  is  then 
proportional  to  E.  The  orientadon  of  a  weight  vector  is  affected  by  weight  erasure  if  Ap  and  Ag  are 
nonzero.  The  erasure  term  distorts  gradient  descent  neural  network  models  such  as 
backpropagation  which  are  sensitive  to  both  orientation  and  magnitude  of  5E,  but  has  little  effect 
on  other  neural  networks  such  as  the  Perception.  If  the  output  values  of  a  Perceptron  are  (1,-1) 
and  the  threshold  is  0,  then  weight  decay  will  not  affect  the  Perceptron  output  because  the  linear 
discriminant  depends  only  on  the  orientation  of  the  weight  vector. 

The  above  equation  can  be  rewritten  using  neural  network  nomenclature.  Each  weight  wjj 

connecting  neurons  i  and  j  is  then  described  by  an  equation  of  the  form 


w. 


^iJ 


where  riij=iEscAjAj*/xvv  is  the  updating  term  proportional  to  the  product  of  the  amplitudes  of 
writing  beams  i  and  j.  and  tg  are  the  writing  and  erasing  time  constants,  respectively.  The 
weight  saturation  value  is  Wsat=Tlijtvy^.  In  the  following  we  will  assume  Tg  and  are  constant 
(equivalent  to  assuming  equal  light  intensities  in  all  updates).  We  will  also  suppress  the  neuron 
indices  i  and  j. 

We  consider  multi-epoch  recording  (sometimes  called  incremental  recording)  in  which  each 
hologram  is  exposed  for  a  shon  time  and  the  set  of  holograms  is  cycled  through  many  times.^^ 
This  recording  method  is  appropriate  for  neural  networks  in  which  the  number  of  required 
exposures  is  not  known  be^''rehand.  We  will  also  assume  non-orthogonal  holograms  in  which 
each  connection  weight  may  be  updated  an  arbitrary  number  of  times  during  learning.  Our 
objective  is  to  derive  conditions  for  the  linear  superposition  of  exposures  in  multi-epoch  recording. 

In  the  first  epoch  after  the  n-th  expxisure  the  solution  to  Eq.  (1)  is 
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in  which  subscripts  now  refer  to  exposures  and  t  is  the  time  elapsed  since  the  stan  of  the  n-th 
exposure.  Equation  (2)  expresses  the  current  weight  values  in  terms  of  the  weight  value  at  the  end 
of  the  previous  exposure.  It  can  be  solved  recursively  to  yield 

+  %(l  )”f 

p=0 


where  WQ  is  the  initial  weight  value  and  T  is  the  exposure  time  for  each  update  (assumed  constant 

in  multi-epoch  recording).  Thus  at  the  end  of  the  first  epoch  the  weight  value  is  an  exponentially 
weighted  sum  of  the  updates  T)(,.  Now  assume  w(pO  and  sum  over  Q  epochs,  each  of  which 

consists  of  N  holographic  exposures  (one  for  each  exemplar).  After  the  Q-th  epoch, 

=  %(i it 

<7=1  p=0 


where  q  is  the  epoch  index.  Now,  if  the  order  of  exemplars  is  the  same  in  each  epoch,  then  by 
definition  q=T|jsj.p  ^^d  the  summation  over  q  can  be  done  analytically: 


'*^1  - — zTjTrrr 


In  order  to  minimize  distortions  due  to  photorefiractive  erasure,  wq  should  be  forced  to  be 
proportional  to  T)  averaged  over  the  N  exemplars.  This  is  the  case  if  T«xe/N  : 


if  NT«te 


The  value  of  the  weight  (light  amplitude  diffraction  efficiency)  is  maximized  if  in  addition  the 
number  of  epochs  satisfies  Q»xe/NT  : 


where 


)??  if  NT<<XE<<QNT 


n=l 


N 


50 


Thus  under  these  conditions  the  weight  is  simply  proportional  to  the  linear  average  of  the  update 
values,  which  is  desirable  for  neural  network  models.  The  diffraction  efficiency,  which  is 
proponional  to  Iwq|2,  decreases  as  l/N^. 

A  demonstration  of  multi-epoch  recording  of  superimposed  holograms  using  fanned 
reference  beams  is  illustrated  in  Fig.  30.  In  this  case  the  objects  were  rotated  versions  of  a  gray¬ 
scale  woman  image  and  the  references  were  orthogonal  grid  patterns.  As  shown  in  Figs.  30(b) 
and  30(c),  each  of  the  superimposed  holograms  could  be  read  out  with  very  little  crosstalk  between 
holograms  and  with  approximately  equal  diffraction  efficiencies.  Fig.  31  is  a  3-D  plot  of  relative 
diffraction  efficiency  measured  for  30  superimposed  holograms  recorded  using  multi-epoch 
exposures.  The  figure  plots  relative  diffraction  efficiency  of  each  hologram  when  read  out  by  each 
of  the  references.  The  objects  and  references  were  non-orthogonal  random  patterns.  Due  to  the 
non-orthogonality  of  the  patterns,  one  expects  approximately  25%  crosstalk,  which  agrees  with 
experiment.  The  variations  in  diffraction  efficiency  along  the  diagonal  can  also  be  attributed  to  the 
random  nature  of  the  patterns. 


9129-06-41 


OBJECT  (a)  REFERENCE 


Fig.  30.  Experimental  demonstration  of  hologram  superposition  using  fanned  reference 
beams,  (a)  One  of  the  object-reference  pairs  used  for  recording.  References  were 
mutually  orthogonal.  Objects  were  rotated  versions  of  the  woman  image. 
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30  Holograms  Superimposed  Using  Random  Reference  Patterns 


Fig.  31.  Readout  of  30  holograms  exposed  using  multi-epoch  recording  and  random  reference 
patterns. 

An  interesting  result  is  obtained  if  we  plot  the  mean  hologram  diffraction  efficiency  against 
the  number  of  recorded  holograms  on  a  log-log  plot.  If  multiple  holograms  are  record^"* 
coherently  in  a  single  exposure  then  we  would  expect  the  diffraction  efficiency  per  hologram  to 
decrease  as  1/N,  where  N  is  the  number  of  holograms.  If  the  N  holograms  are  recorued  in  N  time- 
sequenced  exposures  then  we  would  expect  the  diffraction  efficiency  to  decrease  as  l/N^  due  to 
grating  erasure,  as  discussed  above.  Yet  when  we  plot  the  experimental  results  for  time-sequenced 
recording  using  fanned  references,  the  data  fall  between  he  l/N^  and  1/N  lines,  as  shown  in 
Fig.  32.  We  conjecture  that  this  may  be  due  to  the  spatially-multiplexed  nature  of  the  fanning 
gratings  and  the  associated  spreading  and  filamentation  of  the  reference  beam.  Light  from  a 
particular  reference  pixel  is  directed  by  the  fanning  gratings  to  the  signal  gratings  which  connect  it 
with  object  pixels.  Reference  and  signal  gratings  which  are  not  associated  with  that  reference  pixel 
tend  to  be  bypassed,  therefore  each  grating  lends  to  be  erased  only  by  light  beams  which  are 
connected  by  that  grating.  The  gratings  arc  therefore  erased  less  than  we  would  expect  from  the 
above  analysis  which  assumes  complete  overlap  between  all  of  the  oeams.  This  improved 
efficiency  performance  may  be  an  additional  advantage  of  cascaded-grating  holography. 
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DifTraction  efficiency  of  individual  holograms  normalized  by  composite  hologram  efficiency  and 
plotted  versus  number  of  superimposed  holograms. 


SECTION  6 


OPTICAL  NEUROCOMPUTER  DESIGN 
AND  CONSTRUCTION 

6.1  OPTICAL  REPRESENTATION  OF  NEURAL  NETWORKS 

Most  neural  network  models  require  weights,  error  signals,  and  neuron  outputs  to  assume 
both  positive  and  negative  values.  Bipolar  error  signals  and  weights  are  necessary  even  if  the 
neuron  response  function  saturates  at  1  and  0  in  order  to  both  rectify  wrong  responses  and 
reinforce  correct  responses  without  saturating  the  outputs.  Therefore,  means  must  be  provided  in 
an  ONN  for  bipolar  inputs  and  outputs.  Holographic  ONNs  can  use  coherent  or  incoherent 
methods  for  representing  negative  numbers.  In  coherent  approaches  direct  phase  modulation  of 
light  is  used  to  shift  the  phase  of  written  gratings.  The  phase  of  diffracted  light  beams  is  measured 
by  mixing  with  a  reference  beam  and  using  interferometric  detection.  Interferometric  detection  has 
potential  benefits  in  terms  of  increased  dynamic  range,  but  it  also  has  many  practical  difficulties. 
The  phase  shifting  of  the  input  must  be  done  with  great  uniformity  across  the  entire  SLM  input. 
(Although  this  can  be  accomplished  using  Stokes'  principle.^)  More  problematic  is  the 
interferometic  detection  at  the  output  detector  array.  It  is  difficult  to  maintain  phase  uniformity 
across  the  entire  output  detector  array.  The  system  also  becomes  very  vibration  sensitive.  Finally, 
many  presently  available  SLMs  have  an  amplitude-dependent  phase  response  which  makes 
independent  control  of  phase  and  amplitude  impossible  without  using  an  additional  compensating 
phase-only  SLM,  which  would  greatly  increase  alignment  difficulties  and  system  complexity. 

In  order  to  avoid  these  problems,  we  represent  bipolar  inputs,  weights,  and  outputs  in  our 
SPONN  system  with  spatial  multiplexing  and  electronic  subtraction.  A  bipolar  input  yj  (which 
could  be  an  input  from  neuron  j  or  an  error  signal)  is  divided  into  two  nonnegative  quantities  yj''" 
and  yj'  where 


yj-^=yj  and  yj*=0  if  yj>=0 
yj'*'=0  and  yj'=lyjl  if  yj<0 


These  operations  are  performed  electronically  in  the  host  computer,  yj'*'  and  yj'  are  then  written  to 
two  spatially-separated  SLM  pixels  in  the  reference  half  of  the  optical  input  plane.  For  the  writing 
or  weight  adjustment  phase  of  a  neural  network  algorithm,  two  similar  nonnegative  quantities  are 
written  to  two  SLM  pixels  in  the  object  section  of  the  input  plane  which  represent  a  bipolar  error 
signal  in  neuron  i.  The  crystal  is  then  exposed  with  the  object  and  reference  beams,  forming  four 
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constant-phase  weights  Wy"^,  wjj'*",  Wy""*',  and  Wy"  in  which  a  bipolar  effective  weight 

connecting  neurons  i  and  j  is  encoded.  The  writing  phase  is  further  divided  into  two  subphases  in 
which  the  weights  are  adjusted  twice:  once  with  the  outer-product  ejyj  and  once  with  (-qX-yj). 

Although  these  two  exposures  are  the  same  as  far  as  the  effective  weight  increment  is  concerned, 
they  serve  to  ensure  that  the  cross-diagonal  weights  are  equal,  e.g.  Wy'*^=Wy"=Wy+  and  Wy"*"* 
=Wy"*'=Wy'.  The  significance  of  this  will  be  evident  below. 

The  bipolar  algorithm  in  the  readout  phase  is  illustrated  in  Fig.  33.  Input  yj  is  again  split 
into  positive  and  negative  parts  which  read  out  the  weights  in  the  hologram.  Square-law  detection 
of  the  diffracted  light  beams  is  performed  at  two  CCD  pixels,  forming  the  two  intermediate  terms 
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Fig.  33.  Algorithm  for  bipolar  representation  uf  weights  and  neuron  values. 
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(In  our  actual  system  many  CCD  pixels  are  used  for  each  output  so  that  pixel  averaging  as  well  as 
temporal  averaging  :an  be  used  to  reduce  noise.)  The  square-root  of  each  of  these  CCD  outputs  is 
formed  before  subtracting  them.  Taking  advantage  of  the  forced  equivalence  of  cross  diagonal 
weight  values,  the  final  output  is  given  by 
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which  represents  a  true  bipolar  output  with  bipolar  weights  and  inputs.  The  electronic  ponions  of 
the  algorithm  are  not  bottlenecks.  The  square-root  is  performed  at  video  rates  on  the  entire  video 
frame  using  a  lookup  table  in  the  output  image  processor  card  while  integer  subtraction  is  also 
performed  very  quickly.  Note  that  this  method  relies  on  the  fact  that  all  terms  within  each  of  the 
magnitude-squared  brackets  have  the  same  phase,  namely  that  of  the  object  beam  at  the  respective 
CCD  pixel. 

A  diagram  of  the  operation  of  a  single  neural  network  layer  in  the  SPONN  system  during 
readout  and  write  phases  is  shown  in  Fig.  34.  Shaded  components  denote  the  optical  portion  of 
the  system.  The  first  step  in  the  readout  phase  is  the  transfer  of  output  from  the  previous  neural 
layer  (stored  in  host  memory)  to  SLM  pixels  in  the  R  input  plane.  Each  bipolar  value  is  rirst 
converted  to  two  nonnegative  compxinents  as  described  above.  The  R  plane  pixels  then  form  the  R 
beam  which  reads  out  the  hologram,  performing  an  optical  matrix -vector  multiplication.  The 
output  is  detected  by  two  CCD  pixels  resulting  in  the  raw  outputs  d"*"  and  d'.  The  square  root  of 
each  output  is  taken  using  lookup  tables  on  the  output  frame  grabber  board  before  they  are 
subtracted  and  passed  through  the  neuron  response  function,  resulting  in  the  output  for  that  layer. 

The  host  computer  then  calculates  an  error  signal  e  according  to  the  particular  neural 
network  model  being  implemented.  The  e  is  usually  quantized  to  -i-l,  0,  or  -1  because  the  neural 
algorithm  requires  it  (as  in  the  Perceptron  or  quantized  backpropagation)  or  due  to  the  limitations  of 
the  input  SLM,  or  both.  In  our  experiments  we  have  used  both  a  Hughes  liquid  crystal  light  valve 
(LCLV)  and  a  Sony  active-matrix  liquid  crystal  TV  (LCTV).  The  LCLV  has  an  amplitude- 
dependent  phase  response  which  results  in  phase  errors  during  learning.  A  simple  way  to  avoid 
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Fig.  34.  Schematic  of  information  flow  in  one  layer  of  SPONN.  Shaded  lines  denote  optical  pathways. 


such  errors  is  to  restrict  the  SLM  pixel  values  to  1  and  0.  Our  bipolar  algorithm  then  allows  us  to 
represent  effective  values  of  1, 0,  and  -1. 

Each  quantized  error  value  is  then  written  to  two  SLM  pixels  in  the  O  plane  according  to  the 
bipolar  algorithm.  The  hologram  is  then  exposed  using  the  O  and  R  beams  (the  original  input 
pattern  is  still  on  the  R  beam),  implementing  a  weight  matrix  update  equal  to  the  outer-product 
between  the  e  and  input  pattern  vectors.  This  sequence  is  repeated  for  each  layer  in  the  network, 
each  exemplar,  and  each  training  epoch. 

6.2  EXPERIMENTAL  SETUP 

A  diagram  of  our  programmable  optical  neurocomputer  (SPONN  or  Stimulated 
Photorefractive  Optical  Neural  Network)  is  shown  in  Fig.  35.  In  assembling  this  system  our  goals 
were  hardware  simplicity  and  programmability.  SPONN  contains  a  single  photorefractive  crystal, 
SLM,  and  CCD  detector.  Multi-layer  networks  are  implemented  by  superimposing  different 
weight  layers  in  the  same  crystal.  A  plane  wave  readout  beam  from  an  argon  laser  at  514  nm  was 
spatially  modulated  by  the  SLM.  In  the  experiments  described  here  the  SLM  was  a  Sony  active- 
matrix  liquid  crystal  TV  (LCTV)  which  we  disassembled.  It  has  30,000  pixels  and  a  contrast  ratio 
of  2(X)  at  514  nm.  We  also  used  a  Hughes  Liquid  Crystal  Light  Valve  (LCLV)  in  our  early 
experiments.  (Although  the  LCXV  is  superior  to  the  LCTV  in  terms  of  resolution,  we  took  it  out 
of  the  system  due  to  its  more  complicated  and  bulky  addressing  circuitry,  its  reflection  mode  of 
operation,  and  the  slow  response  time  of  the  CdS  LCLVs  that  were  available  to  us.) 
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Fig.  35.  Stimulated  Photorefractive  Optical  Neural  Network  (SPONN)  experimental 
configuration. 
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The  input  plane  displayed  on  the  SLM  consisted  of  two  portions:  the  reference  (R)  and 
object  (O)  planes.  A  system  of  mirrors  separated  the  O  and  R  planes.  The  R  beam  entered  the 
BaTi03  crystal  at  an  angle  relative  to  the  c-axis  such  that  it  experienced  high  two-wave  mixing 

gain,  resulting  in  fanning.  The  O  beam  was  directed  at  a  low-gain  angle  and  did  not  fan.  (Fanning 
of  the  object  beam  would  have  degraded  the  optical  quality  of  reconstructed  images  since  the 
original  O  beam  would  be  distoned.)  Two  image  processing  cards  (VSKX)  and  FG1(X)  from 
Imaging  Technology,  Inc.)  were  installed  in  the  host  computer  (an  80386  PC).  The  input  image 
processor  was  dedicated  to  providing  video  input  to  the  SLM.  The  output  image  processor 
grabbed  video  from  the  CCD  camera  which  detected  the  optical  output  of  SPONN.  Pixels  in  the  O 
and  R  planes  represent  neurons  in  the  various  network  layers. 

After  a  hologram  was  written,  liquid  crystal  shutter  SI  was  closed  by  the  host  computer 
and  the  R  beam  read  out  the  hologram.  (Closing  SI  ensured  that  no  light  leakage  due  to  the  finite 
contrast  ratio  of  the  SLM  was  passed  through  from  the  O  input  plane  to  the  detector  during 
readout.)  The  CCD  camera  detected  the  holographic  reconstruction  of  the  O  plane.  Bragg 
degeneracy  was  greatly  reduced  due  to  the  fanning  of  the  R  beam  and  the  reconstructed  image 
quality  was  good,  as  explained  in  the  previous  sections. 

Several  measures  were  taken  to  reduce  noise  and  distortions.  In  order  to  reduce  spatial 
aliasing,  the  "optical  neurons"  were  oversampled  by  the  CCD  pixels.  An  automated  alignment 
program  was  used  to  pick  an  operator-selected  number  of  CCD  pixels  to  use  tor  each  output 
neuron.  Coarse  alignment  was  done  using  an  affine  coordinate  transformation  between  the  SLM 
input  and  CCD  output  planes  calculated  from  the  positions  of  three  points  and  their  corresponding 
images  in  the  two  planes.  Final  CCD  pixel  selections  were  made  by  sorting  and  picking  the 
brightest  ones  The  CCD  pixel  values  were  then  summed  in  order  to  reduce  detector  noise. 

Multiple  video  frames  could  also  be  summed  to  funher  reduce  noise.  In  addition,  nonlinearities  in 
the  SLM  transfer  function  were  measured  and  compensated  by  modifying  the  output  lookup  tables 
of  the  input  image  processor.  This  provided  us  with  greater  accuracy  and  control  of  the  neuron 
activation  function.  We  found  that  system  noise  was  dominated  by  fluctuations  in  the  laser  power. 
In  our  experiments  we  used  an  old  argon  ion  laser  with  relatively  poor  stability.  Without  pixel  or 
frame  averaging  we  measured  the  output  signal  relative  standard  deviation  to  be  1.9%.  By 
spatially  averaging  9  pixels  and  temporally  averaging  9  frames,  we  reduced  the  relative  standard 
deviation  of  the  noise  to  0.74%.  Future  SPONN  models  will  use  more  stable  lasers  and  feedback 
systems  to  reduce  the  noise  level. 

Computer-controlled  liquid  crystal  cells  LC-1  and  LC-2  were  used  to  control  the  optical 
power  during  the  hologram  writing  and  reading  phases.  During  writing,  LC-1  and  Si  were  both 
turned  on  (full  transmittance)  for  high  power  R  and  O  beams  and  quick  writing  times. 
Simultaneously,  LC-2  was  turned  off  so  that  the  CCD  camera  was  not  saturated  by  the  O  beam. 
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During  reading  LC-1  was  switched  to  low  so  that  the  gratings  were  not  quickly  erased  by  the  R 
beam.  S 1  was  closed  so  that  the  input  O  beam  did  not  interfere  with  the  reconstructed  O  beam  and 
LC-2  was  turned  fully  on  for  maximum  detector  sensitivity. 

The  topology  of  the  network  being  implemented  is  determined  by  the  organization  of  the 
input  plane.  As  shown  in  Fig.  36,  a  single  layer  network  with  only  forward  connections  is 
implemented  by  devoting  the  entire  R  plane  to  the  input  layer  of  the  network,  Lq.  Likewise,  the 
output  layer,  Lj,  occupies  the  endre  O  plane.  Turning  pixels  on  in  the  R  and  O  planes  forms 
connections  between  Lq  and  Lj.  Lq  then  reads  out  the  weights  connecting  the  R  and  O  planes. 
Multiple  layer  neural  networks  are  implemented  by  dividing  the  input  plane  into  sectors.  For 
example,  to  implement  a  net  with  an  input  layer  Lq,  a  hidden  layer  Lj,  and  an  output  layer  L2,  the 
input  plane  is  divided  into  four  sectors,  as  shown  in  Fig.  18.  Connections  between  Lq  and  Lj  are 
formed  by  writing  values  to  the  Lq  and  Lj  sectors  in  the  R  and  O  planes,  respectively.  Similarly, 
connections  between  Lj  and  Lj  are  formed  by  writing  values  to  the  Lj  and  L2  sectors  in  the  R  and 

O  planes,  respectively.  The  two  weight  layers  are  exposed  separately  to  avoid  unwanted  links 
between  Lq  and  L2.  To  read  out  the  network,  an  input  pattern  is  first  written  to  Lq  in  the  R  plane. 

The  optical  matrix- vector  product  is  then  detected  by  the  CCD  camera  at  Lj  in  the  O  plane  and 
electronically  processed  to  form  the  neuron  values  in  the  hidden  layer  L].  These  values  are  then 
written  to  the  Lj  sector  in  the  R  ponion  of  the  optical  input  plane.  Another  optical  matrix-vector 
product  is  formed,  detected  at  L2  in  the  O  plane,  and  processed  to  form  the  output  neuron  values  in 
L2. 
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Fig.  36.  SPONN  implementation  of  multi-layered  neural  networks  by  spatial  organization  of 
optical  input  plane. 
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If  more  complicated  networks  with  additional  features  such  as  backward  connections  or 
addinonal  layers  are  required,  they  can  be  implemented  by  simply  dividing  the  optical  input  plane 
into  more  sectors  and  changing  the  software.  It  is  not  necessary  to  adjust  the  hardware  or  add 
more  crystals.  This  easy  reconfigurability  is  due  to  the  capabilit;f  of  arranging  neurons  in  arbitral^' 
patterns  in  the  optical  input  plane,  a  feature  made  possible  by  thej elimination  of  Bragg  degeneracy. 

6.3  PACKAGING  | 

The  prototype  research  model  of  SPONN  described  in  tljp  previous  section  was  designed 
for  maximum  flexibility  using  off-the-shelf  components,  including  a  bulky  water-cooled  argon  ion 
laser.  No  effort  was  made  to  package  the  system  in  a  compact  viilume  as  the  emphasis  of  this 
program  was  the  demonstration  of  working  optical  neural  networks.  However,  we  did  investigate 
packaging  concepts  for  future  versions  of  SPONN  which  would  use  custom  components  and 
compact  solid  state  laser  sources  which  should  become  available  at  reasonable  cost  in  the  next  few 
years.  In  this  section  we  describe  two  concepts,  one  using  currently  available  components  and  the 
other  using  components  which  should  become  available  within  the  next  five  years. 

Our  near  term  packaging  concept  is  shown  in  Fig.  37.  The  overall  size  of  this  package  is 
15”  by  20"  plus  10"  by  15"  for  the  laser  power  supply,  or  about  the  size  of  a  desktop  PC.  It  is 
based  on  a  diode-pumped  YAG  solid  state  laser  available  from  Adlas,  Inc.  The  laser  is  frequency 
doubled  using  a  nonlinear  crystal  to  emit  140  mw  at  532  nm.  This  wavelength  is  close  to  optimum 
for  typical  undoped  BaTi03  holograms.  Although  greater  optical  power  would  be  desirable,  it  is 
enough  for  a  demonstration  system.  We  also  assume  a  3"  LCTV  as  the  SLM.  The  relatively  large 
size  of  the  LCTV  sets  the  dimensions  of  the  optical  train.  Note  the  use  of  a  solid  optics  assembly 
constructed  from  prisms  and  employing  total  internal  reflection  to  form  the  R  and  O  beams.  By 
attaching  the  photorefractive  crystal  to  the  optics  assembly,  the  paths  of  the  two  beams  are 
completely  contained  within  a  single  glass  or  plastic  unit.  This  greatly  reduces  vibration  sensitivity 
and  simplifies  optical  alignment.  Solid  optics  techniques  are  used  in  laser  gyroscopes  for 
navigation  and  guidance  of  commercial  and  military  aircraft  and  missiles.  Such  gyroscopes,  which 
are  also  interferometric  devices,  are  used  successfully  in  high  vibration,  grueling  environments. 

As  part  of  our  packaging  investigation,  we  constructed  a  Michaelson  interferometer  mounted  in  an 
aluminum  box  with  a  laser  diode  light  source.  We  found  that  the  interference  pattern  was 
unaffected  by  casual  handling  of  the  package  or  by  dropping  heavy  objects  next  to  it.  This 
increased  our  confidence  that  an  inexpensive  vibration-isolated  package  for  SPONN  can  be 
constructed. 

By  replacing  the  YAG  laser  with  a  laser  diode  and  using  a  1"  SLM,  the  package  size  can  be 
dramatically  reduced  to  less  than  5.5"  by  10",  as  shown  in  Fig.  38.  Thanks  to  market  demand 
from  other  fields,  CCD  cameras  and  laser  diodes  are  now  available  which  would  fit  into  such  a 
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Fig.  37.  (a)  Design  layout  for  SPONN  optical  neural  network  using  presently  available  green 
solid  state  laser,  (b)  SPONN  solid  optics  assembly  for  beam  direction  and  improved 
vibration  resistance. 

volume.  Components  which  should  become  available  in  the  next  few  years  are  the  1 "  by  1 " 
electrically-addressed  SLM  and  doped  BaTi03  which  is  sufficiently  sensidve  at  the  laser  diode 
wavelength.  One  possibility  for  the  SLM  is  the  Hughes  CCD-addressed  LCLV.26  A  number  of 
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other  companies  and  universities  are  also  working  on  electrically-addressed  SLMs.  An  alternative 
to  doped  BaTi03  is  to  replace  it  with  another  photorefractive  material  which  is  sensitive  in  the 
infrared,  such  as  CdTe,  GaAs,  or  multiple  quantum  well  structures. 


Fig.  38.  Design  layout  for  SPONN  optical  neural  network  using  laser  diode  and  1-inch  SLM. 


SECTION  7 

IMPLEMENTATION  OF  NEURAL 
NETWORK  MODELS 


7.1  PERCEPTRON 

The  Perception  was  one  of  the  first  neural  networks  to  be  invented.  In  its  most  commonly 
implemented  form  it  consists  of  a  single  layer  of  weights  connecting  a  field  of  input  neurons  with 
a  single  output  neuron.  (Some  of  the  original  Perception  networks  contained  a  "preprocessing” 
layer  with  fixed  weights  which  were  not  adjusted  during  learning.)  A  single  output  neuron  can 
dichotomize  or  separate  the  vector  space  of  input  patterns  into  two  classes.  The  weight  values 
together  with  the  threshold  value  of  the  output  neuron  determine  a  separating  hyperplane  for  pattern 
vectors.  Therefore  it  can  only  distinguish  between  classes  which  are  linearly  separable.  The 
Perception  is  appealing  as  a  first  test  f  neural  hardware  because  of  its  simplicity  and  the  fact  that  a 
learning  rule  with  guaranteed  convergence  is  known  (provided  a  solution  can  be  represented  in  the 
first  place).  Its  main  weakness  is  that  as  a  single-layer  network  it  is  limited  to  linearly-separable 
solutions  and  therefore  cannot  solve  many  problems  of  practical  interest. 

ITie  output  of  a  simple  Perception  is  given  by 


y  =  h\ 


WjX:  -  6 
i  J  J 


K  ) 


where  w  is  the  weight  vector  connecting  the  input  neurons  x  with  the  output  neuron,  0  is  the 
output  neuron  threshold  value,  and  h(z)  is  a  hard-threshold  response  function  with  outputs  1  or  - 1 
for  z>0  and  z<=0,  respectively.  Input  patterns  are  binary  and  normally  assume  values  of  1  or  0.  6 
can  be  learned  by  setting  one  of  the  input  neurons  to  1,  although  in  our  experiments  we  set  6=0. 
The  learning  rule  is  simple  and  can  be  expressed  in  terms  of  the  weight  upxlate  vector  as 


where  is  the  desired  output  for  exemplar  c.  If  the  output  is  correct,  no  change  is  made.  If  y=l 
but  D(.=-l,  a  change  proportional  to  the  negative  of  the  exemplar  is  made.  Finally,  if  y=-l  but 
Dc=l,  a  change  proportional  to  the  exemplar  is  made.  A  slight  modification  of  this  algorithm  was 
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made  in  the  optical  implementation.  A  "forbidden  zone"  centered  on  zero  was  defined  for  neural 
outputs  before  thresholding:  if  the  value  of  LwjXj-6  was  within  this  zone  then  a  correction  to  the 

weights  was  made  even  if  the  thresholded  network  output  was  correct.  This  made  the  system  more 
robust  by  penalizing  small  weight  values. 

A  diagram  of  the  SPONN  optical  input  plane  for  implementation  of  a  Perceptron  network  is 
shown  in  Fig.  39.  The  R  plane  contains  the  pattern  in  the  input  neuron  layer  Lq  and  the  O  plane 
contains  the  output  neuron  layer  Lj.  In  our  experiments  we  have  used  both  a  single  output  neuron 
and  many  output  neurons.  The  latter  case  is  essentially  many  Perceptions  operating  in  parallel  on 
the  same  input  patterns  but  with  different  classification  goals.  In  all  of  the  experimental  examples 
described  here  the  problem  to  be  solved  was  the  transformation  of  a  set  of  random  binary 
exemplars  (2-D  random  patterns  with  pixel  values  of  1  and  0)  into  another  set  of  random  binary 
patterns. 
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Fig.  39.  Optical  input  plane  for  single  layer  Perceptron. 

The  first  Perceptron  experiment  had  a  single  output  neuron,  the  goal  was  to  separate  or 
dichotomize  a  set  of  random  patterns  into  two  classes.  The  results  of  this  experiment  is  shown  in 
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TOTAL  ERROR 


Fig.  40  which  is  a  plot  of  total  error  during  learning  veicus  ei.xx:h  number.  In  this  case  the  optical 
neural  network  learned  to  dichotomize  96  random  patterns  after  29  epochs.  Each  pattern  consisted 
of  1920  pixels  (60x32).  Increasing  the  number  of  pixels  tended  to  reduce  the  number  of  panems 
that  could  be  learned  although  with  7680  pixels  (120x64)  the  system  could  still  learn  42  patterns. 
In  order  to  test  the  noise  level  in  the  system,  we  attempted  to  dichotomize  two  patterns.  The 
patteins  were  identical  except  for  a  prescribed  number  of  differing  pixels.  We  reduced  the  number 
of  differing  pixels  until  the  optical  neural  network  could  no  longer  distinguish  them.  As  shown  in 
Fig.  41,  the  system  could  separate  patterns  containing  as  few  as  0.5%  differing  pixels. 


1  10  19  28  37  46  55  64  73  82  91 
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Fig.  41.  Optical  Perceptron  separation  of  two  nearly  identical  patterns  which  have  1920 
pixels  and  diRer  by  0.5%. 
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We  were  also  able  to  implement  multiple-output-neuron  Perceptron  networks.  The 
networks  learned  to  perform  a  one-to-one  transformation  of  a  given  set  of  random  binary  patterns 
(values  1  and  0)  into  another  set  of  random  patterns  (values  1  and  -1).  The  results  of  one 
experiment  are  shown  in  Fig.  42  in  which  the  input  and  output  layers  contained  1740  and  870 
neurons,  respectively.  This  network  had  a  total  of  1.5  million  weights.  We  then  scaled  this 
network  up  to  10,260  neurons  and  2x10^  weights.  The  leamiug  curve  for  this  larger  network  is 
shown  in  Fig.  43.  The  processing  rate  was  2xl07  connections  updated  per  second  during 
learning.  This  rate  was  limited  by  the  PC  host  bus  since  we  had  to  transfer  neuron  values  back  and 
forth  between  the  image  processor  cards  and  host  memory.  In  the  next  phase  of  this  project  we 
will  install  an  accelerator  card  with  a  local  bus  connection  to  the  image  processor  cards  which 
should  greatly  increase  the  processing  rate. 
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fig.  42.  Multi-output-neuron  single-layer  optical  Perceptron  learning  of  4  exemplars.  The 
network  consisted  of  1740  input  nc  irons  and  870  output  neurons. 
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Fig.  43.  Multi-output-neuron  single-layer  optical  Perceptron  learning  of  4  exemplars.  The 
network  consisted  of  6840  input  neurons,  3420  output  neurons,  and  2x10^ 
connection  weights. 

We  also  used  this  type  of  network  in  a  handwritten  digit  recognition  application. 

Exemplars  were  extracted  from  a  database  of  handwritten  digits  supplied  by  the  U.S.  Post  Office. 
Examples  of  the  digits  are  shown  in  Fig.  44.  The  network  was  trained  to  label  inputs  as  one  of  the 
ten  digits.  The  total  error  during  learning  of  80  exemplars  is  also  shown  in  Fig.  44.  We  did  not 
expect  good  performance  because  this  network  contains  a  single  layer  of  weights  and  the  character 
recognition  problem  is  not  linearly  separable.  The  network  achieved  an  8%  error  rate  on  the 
training  set  and  a  25%  error  rate  on  40  test  digits  it  had  not  seen  before.  Although  this  was  not 
good  enough  for  practical  use,  it  did  demonstrate  generalization  and  learning  since  the  expected 
untrained  error  rate  is  90%. 
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(vs.  untrained  rate  of  90%) 


Fig.  44.  Handwritten  digit  recognition  using  optical  Perceptron  network. 


7.2  BIDIRECTIONAL  ASSOCIATIVE  MEMORY 


The  Bidirectional  Associative  Memory  (BAM)  neural  network  is  a  type  of  hetero- 
associative  memory  with  some  generalization  or  learning  properties.  Heteroassociative  means  that 
when  a  partial  or  distoned  version  of  a  stored  pattern  is  input  to  the  network,  the  stable  output  is  a 
complete  undistorted  version  of  a  stored  output  pattern  that  is  "associated"  or  originally  recorded 
with  the  input  jsattem.  The  BAM  is  illustrated  in  Fig.  45.  It  consists  of  two  neuron  layers  and 
Fg.  The  neuron  activation  functions  are  hard  thresholds.  Patterns  activating  layer  Fy^  are 
thresholded,  weighted,  and  transmitted  to  Fg  (bottom  up).  The  resultant  patterns  are  in  turn 
thresholded  by  Fg  and  transmitted  back  down  to  F^  via  the  same  set  of  weights  (top  down).  The 
cycle  then  repeats.  Kosko^^  has  shown  that  the  function 

£(a,)3)  =  el)p 

always  decreases  as  the  system  evolves.  In  the  above  expression  (a,P)  are  column  vectors 
representing  the  patterns  in  (F;^,Fg)  and  (O^.^b)  ^  corresponding  threshold  levels.  Since  E 
is  bounded  from  below  and  it  always  decreases,  it  is  an  "energy"  or  Lyapunov  function  that  is 
minimized  as  the  system  evolves  in  time.  The  minima  of  E  correspond  to  stable  limit  points.  The 
only  condition  on  the  connection  matrix  for  this  to  be  true  is  that  the  weights  in  the  bottom  up 
direction  are  the  same  as  in  the  top  down  direction.  This  corresponds  to  having  the  same  weight 
connecting  neurons  i  and  j  in  both  directions  (e.g.,  hij=hjj  or  H=H’^).  Kosko  showed  that  the  limit 
points  correspond  to  stored  associated  pattern  pairs  a*’^  and  b*^  if  the  connection  weights  are  given 
by  a  sum  of  outer  products: 

M 

>H,  = 

m=l 

Kosko  also  showed  that  BAM  is  capable  of  limited  learning  or  generalization  from  examples. 

We  implemented  the  BAM  neural  network  on  SPONN  by  modifying  the  Perceptron 
software  to  divide  each  of  the  R  and  O  planes  into  two  sections.  R  was  divided  into  subsections 
Rj  and  R2  and  O  into  Oj  and  O2.  During  the  readout  phase  layer  F;^  for  the  bottom  up  direction 
was  represented  in  Rj  and  Fg  for  the  top  down  direction  was  in  R2.  In  order  to  always  use  the 
fanned  R  beam  to  read  out  the  hologram,  separate  physical  connections  were  used  for  the  bottom 
up  and  top  down  weights,  although  they  were  forced  to  be  equal  in  value  as  r^uired  by  the  BAM 
model.  During  the  writing  or  weight  adjustment  phase,  Fg  was  represented  in  Oj  and  F/^  in  O2. 
Thus  the  bottom  up  F;^->Fg  weights  were  adjusted  by  turning  on  Rj  and  Oj  while  the  top  down 
Fb->Fa  weights  were  exposed  by  R2  and  02-  The  two  sets  of  weights  were  exposed 
simultaneously  to  ensure  their  equality. 
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Fig.  45.  Demonstration  of  optical  BAMM  neural  network  (hetero-associative  memory). 


Typical  experimental  results  are  shown  in  Fig.  45  which  illustrates  the  recall  of  two 
associated  pattern  pairs.  Addressing  the  system  with  a  noisy  version  of  stored  panem  a  resulted  in 
a  stable  state  which  cycled  between  a  and  its  associated  pattern 

7.3  BACKPROPAGATION 

Although  Perceptron  and  BAM  networks  are  interesting  and  useful  as  tests  of  neural 
hardware,  their  single-layer  nature  limits  their  usefulness  to  separable  problems.  Adding  a  layer 
greatly  increases  the  power  of  a  single-layer  network.  In  fact,  it  has  been  shown  that  a  single- 
hidden-layer  network  (input,  hidden,  and  output  neuron  layers  with  two  layers  of  weights)  can 
approximate  any  function  to  arbitrary  accuracy  provided  a  sufficient  (but  finite)  number  of  neurons 
is  available.^  One  of  the  most  popular  multi-layer  learning  algorithms  in  terms  of  applications  is 
backpropagation.29,30  jn  this  section  we  describe  our  optical  implementation  of  the 
backpropagation  algorithm. 

Backpropagation  is  based  on  steepest  descent  of  an  error  surface  defined  by 

c  i 


where  E  is  the  total  error,  Dj  ^  is  the  desired  value  for  output  neuron  i  given  exemplar  input  c,  and 
y(f')i  c  is  the  actual  output  for  exemplar  c.  The  weight  adjustment  rule  during  learning  is  based  on 


the  steepest  descent  rule: 


A  (n) 

A»v>7  =  ~77 — r-\ 


where  the  superscript  (n)  refers  to  the  weight  layer  being  updated.  Assuming  a  two-layer  network, 
and  without  going  into  the  details  of  the  derivation  here,  the  error  gradient  is  given  by  the 
following  set  of  equations  for  the  output  layer  (see  ref.  22  for  details): 


*(>)=/[/''W]=>’(i-y) 


and  by  these  equations  for  the  hidden  layer: 
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where  f(x)=l/(l+e*’^)  is  the  neuron  sigmoidal  response  function. 

One  possible  issue  in  the  optical  implementation  of  backpropagation  is  its  sensitivity  to  the 
accuracy  of  representation  of  the  functions  f()  and  g().  Due  to  inherent  nonlinearities  and 
nonuniformity  in  present  SLMs,  f()  and  g()  may  differ  from  the  above  form.  We  have  performed 
computer  simulations  of  backpropagation  with  slightly  different  f()s  and  g()s.  We  found  that 
although  the  network  is  robust  with  regard  to  changes  in  f(),  performance  degrades  if  g()  departs 
from  its  proper  dependence  on  f().  Therefore,  in  our  optical  implementations  we  measured  the 
SLM  nonlinearity  and  electronically  compensated  for  it  so  that  g()  could  be  implemented 
accurately. 

In  the  optical  system  we  actually  implemented  a  variation  of  backpropagation  in  which  the 
input  error  signals  were  trinary  quantized  to  +1, 0,  and  -1  according  to  the  algorithm  reported  by 
Shoemaker  et  al..^*  They  found  that  trinary  quantization  improved  the  convergence  speed  of 
backpropagation  for  a  wide  variety  of  problems.  Our  own  computer  simulations  confirmed  this. 
However,  our  primary  reason  for  trinary  quantization  was  to  avoid  amplitude-dependent  phase 
errors  in  the  LCLV. 

The  optical  input  plane  for  backpropagation  is  shown  in  Fig.  46.  The  network  consisted  of 
three  neuron  layers  (Lq,  Lj,  and  and  two  weight  layers.  Note  that  the  L1-L2  weights  are 
actually  implemented  as  two  separate  sets  of  weights,  one  for  the  forward  pass  Lj  -  >  L2  and 
another  one  for  the  backward  pass  L2  ->  Lj.  As  explained  previously,  this  is  done  because  the 
fanned  reference  beam  (R  plane)  is  always  used  to  read  out  the  unfanned  object  beam  (O  plane).  In 
order  to  implement  forward  and  backward  passes  through  the  same  set  of  effective  weights,  two 
sets  of  photorefractive  weights  must  be  exposed.  They  are  exposed  so  as  to  make  the  forward  and 
backward  weights  as  nearly  equal  as  possible  (symmetric  connections,  wij=wji)  although  they  can 
also  be  made  different  (asymmetric  connections)  if  required  for  certain  networks.  Although  the  Lj 
->  1/2  and  L2  sections  of  the  input  plane  are  shown  spatially  separated  for  clarity,  in  actuality 
they  are  spatially  interleaved  in  order  to  make  the  connections  as  symmetric  as  possible.  The  solid 
arrows  indicate  optical  connections  between  neurons  via  the  hologram.  Dashed  arrows  denote 
electronic  transfer  of  detected  outputs  from  one  layer  (O  plane)  to  the  inputs  of  the  next  layer  in  the 
R  plane.  This  electronic  operation  is  an  order  N  operation  while  the  optical  connection  is  order  N^, 
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where  N  is  the  number  of  neurons.  The  speedup  factor  over  a  single  electronic  processor  is 
therefore  proportional  to  N. 
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Fig.  46.  Optical  input  plane  for  optical  backpropagation  with  a  single  hidden  layer. 

A  flow  diagram  for  the  optical  backpropagation  algorithm  is  shown  in  Fig.  47.  The 
forward  and  backward  weights  between  layers  Lj  and  L2  were  adjusted  simultaneously  in  order  to 
keep  the  weights  as  symmetric  as  possible.  This  also  caused  self-connections  from  each  neuron  to 
itself  to  be  formed.  The  self-connections  do  not,  •'owever,  affect  the  operation  of  the 
backpropagation  network.  As  described  in  the  previous  section  on  the  bipolar  algorithm,  the  L]  to 
Lo  weights  are  exposed  twice  [(Lj  )(L2)  and  (-Lj)(-L2)]  in  order  to  ensure  that  wjj‘‘^=wij"=wjj+ 
and  wjj''“=wij"^=wij'. 

Experimental  results  for  the  problem  of  transforming  one  random  binary  pattern  into 
another  using  optical  backpropagation  are  shown  in  Fig.  48  where  we  plot  the  total  output  error 
versus  epoch  during  learning  of  two  pattern  transformations.  In  this  case  the  network  contained 
252  neurons  arranged  in  three  layers  as  128-62-62.  The  output  error  decreased  to  5%  after  30 
epochs  and  reached  zero  after  90  epochs.  In  order  to  track  the  evolution  of  the  output  pattern  for 
each  of  the  two  exemplar  inputs,  we  also  plotted  the  projection  of  the  output  pattern  vector  on  the 
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Fig.  48.  Optical  backpropagation  learning  of  two  pattern  transrormations  (see  text). 
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reference  vector  (1,1,1,. ..,1).  We  can  clearly  see  the  buildup  of  the  weights  from  small  initial 
values  to  saturation.  In  Fig.  49  we  plot  the  total  error  versus  epoch  after  learning  is  completed  and 
the  network  is  continuously  read  out.  Due  to  the  weight  decay  discussed  previously,  the  errors 
began  to  increase  dramatically  after  200  readouts  of  the  network.  The  weight  decay  is  clearly  seen 
in  the  simultaneous  plots  of  the  output  pattern  vector  projections.  Note  that  the  output  error  does 
not  increase  until  after  the  weights  have  decreased  by  a  large  amount,  indicating  the  network  has 
learned  a  "safety  margin"  which  makes  it  less  sensitive  to  weight  decay.  Analogous  results  are 
shown  in  Fig.  50  for  the  same  network  learning  four  pattern  transformations. 

Results  for  larger  backpropagation  networks  are  shown  in  Fig.  51,  one  with  320  neurons 
distributed  among  three  layers  as  160-80-80  (20,(XX)  weights)  and  another  with  1 140  neurons 
distributed  as  570-285-285  (244,000  weights).  The  first  network  converged  to  approximately  3% 
error  after  50  epochs  while  the  second  one  reached  6%  error  after  1(X)  epochs.  In  both  cases  the 
exemplar  set  consisted  of  two  patterns.  In  these  initial  backpropagation  results  the  performance 
degraded  for  eight  or  more  exemplars.  As  of  this  writing  we  are  still  investigating  why  this  is  so. 
Possible  reasons  include  residual  inaccuracy  in  representing  g()  and  excessive  laser  noise. 
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Fig.  49.  Continuous  readout  of  optical  backpropagation  network  after  lei^ming  has  been  completed. 
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Optical  backpropagation  learning  of  four  pattern  transformations  (see  text). 
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Fig.  51.  Optical  backpropagation  learning  of  two  pattern  transformations.  Results  for  two 
networks  are  shown,  one  with  320  neurons  and  the  other  with  1140  neurons. 
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SECTION  8 


PERMANENT  RECORDING  OF  WEIGHTS  IN 
PHOTOREFRACTIVE  CRYSTALS 

8.1  HOLOGRAM  FIXING  TECHNIQUES 

The  refractive  index  variadons  in  photorefractive  materials  that  arc  normally  used  for  rcal- 
dme  holography,  phase  conjugation  and  optical  signal  processing  are  produced  by  internal  electric 
field  variations  associated  with  charges  trapped  at  mid-gap  levels.  These  holograms  are  generally 
not  permanent  under  illumination  with  light  at  the  wavelength  used  to  write  them  because  this  light 
continues  the  writing  process.  Thus  illumination  with  an  interference  pattern  of  a  new  period  both 
erases  the  old  grating  and  writes  a  new  one,  similarly  illumination  with  uniform  light  erases  the  old 
grating.  In  order  to  obtain  a  fixed  hologram  in  a  photorefractive  material  the  internal  space-charge 
pattern  must  be  transferred  to  a  charged  defect  or  dopant  level  that  is  not  optically  active  at  the 
wavelength  of  interest.  Two  methods  that  can  be  used  to  do  this  in  situ  arc  thermal  and  electrical 
activation.  In  this  section  we  review  the  method  we  pursued  to  obtain  fixed  holograms. 

From  a  systems  point  of  view,  the  most  attractive  hologram  fixing  technique  involved  the 
application  of  an  external  electric  field  in  order  to  transfer  a  normal  photorefractive  hologram  in  the 
photosensitive  deep  levels  to  a  space  charge  hologram  in  an  insensitive  or  non-sensitive  levels. 

The  generic  process  involves  the  steps  illustrated  in  Fig.  52.  Initially,  the  crystal  of  BaTi03  is 
polled  with  a  field  E| ,  shown  in  the  upper  plot.  At  time  ti-t2,  lower  plot,  a  hologram  is  written 
with  the  usual  photorefractive  process.  An  electrical  field  -E2  is  applied  at  t2.  A  mirror  image 
(180°  phase  shifted)  space-charge  pattern  that  nearly  cancels  the  photorefractive  hologram  is 
produced  in  charges  that  are  immobile  and  optically  insensitive  at  room  temperature.  At  time  t3,  the 
photorefractive  hologram  is  erased  leaving  only  the  minor  image  hologram  that  cannot  be  erased 
with  optical  radiation  alone.  At  time  t4  a  field  E3  is  used  to  reset  the  crystal  to  its  initial  condition, 
erasing  the  fixed  hologram.  Note:  in  some  cases  it  may  be  advantageous  to  perform  two  or  tliree 
of  these  processes  at  the  same  time. 

In  materials  with  rather  low  coercive  fields  (on  the  order  of  1  kV/cm)  such  as  BaTi03  and 
Srj.xBaxNb206  (SBN),  electric  fixing  with  domain  reversal  has  been  discussed  in  the  literature. 
The  coercive  field  is  the  critical  applied  electrical  field  required  to  reverse  the  spontaneous 
polarization  of  a  ferroelectric  crystal.  It  has  been  suggested  that  by  applying  a  field  with  an 
amplitude  just  below  the  coercive  value  and  with  an  (mentation  opposed  to  the  orientation  of  the 
existing  polarization,  a  spatial  pattern  of  polarized  clusters  may  be  produced.  If  this  operation  is 


82 


C9127-07-007 


tl  t2  ta  t4  ts 


TIME— - 

Fig.  52.  Waveforms  used  in  hologram  Fixing  experiments. 

performed  after  the  recording  of  a  hologram,  the  cluster  panem  may  mirror  the  recorded  hologram. 
Holograms  fixed  using  domain  reversal  should  not  be  erased  optically,  but  application  of  a  strong 
poling  field  will  restore  the  initial  "blank"  state  in  the  crystal. 

Demonstration  of  this  electrical  fixing  technique  at  room  temperature  was  discussed  by 
Micheron  and  Bismuth  in  BaTi03.  Several  research  groups  have  tried  to  reproduce  the  data, 
however,  at  the  time  of  this  repon,  have  not  been  able  to  observe  the  published  results. 

In  order  to  explain  the  electrical  fixing  and  erasing  of  holograms  in  BaTi03  and  SBN, 
Micheron  and  coworkers  assumed  that  with  an  applied  field  E2  of  reversed  polarity,  ions  or 
vacancies  drift  under  the  influence  of  the  internal  field  Ej  caused  by  the  photoelectrons  trapped 
during  the  writing^step  and  clusters  or  domains  of  reversed  pol?rity  are  formed.  The  field  E2, 
however,  is  not  large  enough  to  excite  photoelectrons.  With  E2  applied,  fast  ionic  displacements 
occur  that  cancel  the  spatial  modulation  of  Ep  The  diffraction  efficiency  drops  to  zero.  Under 
uniform  illumination,  photoelectrons  are  again  photoexcited  and  diffuse  uniformly,  thus  leaving 
behind  an  uncompensated  ionic  pattern  and  revealing  the  original  hologram.  This  pattern  is  stable 
during  readout  and  constitutes  the  fixed  hologram.  Erasure  is  achieved  when  the  applied  field  is 
high  enough  to  restore  unifram  ionic  distributions.  This  explanation  has  been  based  on  the 
hypothesis  that  domain  panems  replicating  the  wrinen  hologram  could  be  induced  in  both  BaTi03 


and  SBN  by  application  of  the  field  £2-  This  hypothesis  accounts  for  the  erasure  process.  Their 
theory  was  partially  supported  by  two  experimental  observations.  First,  photoinduced  clusters 
could  be  written  at  temperatures  above  the  Curie  temperature  in  SBN  and  fixed  by  cooling  the 
crystal  (Micheron  and  Trotier  1974),  thus  establishing  a  link  between  the  occurrence  of  frozen 
clusters  and  antiparallel  domains.  Moreover,  holographic  fixing  in  SBN  is  only  observed  when 
the  voltage  E2d  (where  d  is  the  electrode  spacing)  is  larger  than  the  coercive  voltage  Vc  =  775  V. 
This  is  again  consistent  with  the  picture  of  domain  reversal.  It  remains,  however,  to  be  explained 
why  the  same  authors  did  observe  domain  reversal  in  BaTi03  below  the  coercive  voltage. 

8.2  MEASUREMENTS  OF  BATIO3  ELECTRICAL  PROPERTIES 

In  this  section  we  present  an  overview  of  some  of  the  experiments  we  conducted  with  the 
goal  of  electrically  fixing  photorefiractive  holograms.  In  order  to  characterize  the  state  of  our 
crystals  prior  to  attempting  to  fix  the  holograms,  we  made  measurements  of  the  electrical 
properties,  including  a  determination  of  the  coercive  field  for  each  sample. 

BaTi03  and  SBN  are  ferroelectric  crystals  because  they  have  two  orientational  states  of 
spontaneous  polarization  in  the  absence  of  an  electric  field  and  can  be  shifted  from  one  to  another 
of  these  states  by  applying  an  electric  field.  An  as  grown  crystal  can  have  different  orientations  of 
these  domains  throughout  the  crystal.  By  applying  first  a  mechanical  and  then  a  sufficiently  large 
electric  field  with  the  correct  orientation,  all  of  the  domains  can  be  made  to  align  with  one  another. 
In  our  holographic  measurements,  we  initially  start  with  a  single  domain  crystal  and  write  a  grating 
by  interfering  two  optical  beams.  We  then  applied  various  electrical  waveforms  to  the  crystal  in  an 
attempt  to  create  a  pattern  of  domains  which  mimic  the  distribution  of  the  light  intensity. 

The  most  important  characteristic  of  a  ferroelectric  crystal  is  that  the  relationship  between  P 
and  E  is  represented  by  a  hysteresis  loop.  Ferroelectric  hysteresis  loops  can  be  observed  with  the 
circuit  shown  in  Fig.  53,  which  is  patterned  after  one  first  described  by  Sawyer  and  Tower.  An 
a.c.  field  is  generated  by  a  frequency  synthesizer  or  D/A  board  in  a  computer,  and  amplified  by  a 
high  voltage  amplifier.  The  crystal  is  placed  in  series  with  a  capacitor  Cj  which  h?  -  a  capacitance 
significantly  larger  than  that  of  the  crystal.  The  applied  field  is  monitored  with  a  nigh  voltage 
probe.  The  voltage  drop  across  the  capacitor  Cj  is  proportional  to  the  charge  which  accumulates 
on  the  crystal. 

An  ideal  plot  of  the  charge  or  polarization  of  the  crystal  versus  the  applied  field  is  shown  in 
Fig.  54.  This  plot  clearly  displays  a  hysteresis  loop.  The  polarization  initially  increase  as  a 
function  of  increasing  applied  filed  and  reaches  a  saturated  value,  essentially  independent  of  the 
field.  When  the  field  is  decreased,  the  value  of  the  polarization  does  not  re-traverses  the  initial 
curve.  Some  of  the  domains  remain  aligned  in  the  positive  direction  and  the  value  for  zero  field  is 
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Fig.  53.  Circuit  for  observing  hysteresis  loop. 
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called  the  remnant  polarization,  P^.  The  value  of  the  spontaneous  polarization,  P^,  can  be 
determined  by  extrapolating  of  the  linear  upper  portion  of  the  curve  back  to  the  polarization  axis. 

In  order  to  remove  the  overall  polarization  of  the  crystal  it  is  necessary  to  apply  an  electric 
field  in  the  opposite  or  negative  direction.  From  this  graph,  one  can  measure  the  coercive  field. 
Eg,  the  field  required  to  reduce  P  to  zero.  Further  increase  of  the  negative  field  eventually  causes 

almost  all  of  the  dipoles  to  align  in  the  direction  opposite  to  that  when  the  saturated  positive  field 
was  applied. 

Exporimentally  obtained  hysteresis  loops  for  one  sample  of  BaTi03  are  shown  in  Fig.  55. 
The  graph  is  qualitatively  representative  of  the  different  crystal  samples,  and  shows  two  individual 
hysteresis  loops  which  essentially  overlap  each  other.  These  loops  were  qualitatively  similar  to 
graphs  obtained  for  samples  of  SBN.  We  obtained  close  to  two  thousand  hysteresis  curves  from 
one  of  our  samples  of  BaTi03.  The  initial  curves,  yielded  a  value  of  1.3  KVolts/cm  for  the 
coercive  field  and  2.5x10*5  couVem^  for  the  spontaneous  polarization. 


However,  we  found  that  after  cycling  the  same  crystal  through  several  thousand  hysteresis 
loops,  the  crystal  could  not  attain  the  same  degree  of  polarizability  as  when  it  was  first  grown  and 
thermally  polled.  This  implied  that  some  of  the  domains  were  being  pinned  in  one  orientation. 
Also,  after  taking  the  hysteresis  measurements  we  found  that  the  crystal  faces  to  which  the 
electrodes  were  attached  had  been  damaged.  The  crystal  positive  c-face  was  crazed  and  this  could 
have  impacted  the  magnitude  and  spatial  uniformity  of  the  applied  field. 

8.3  EXPERIMENTS  IN  ELECTRICAL  FIXING  OF  BATIO3 

In  this  section  we  present  a  review  of  the  experiments  we  conducted  to  deteimine  the  effect 
of  applying  electrical  fields  on  photorefractive  crystals  after  a  photorefractive  grating  was  wrinen  in 
the  crystal.  These  series  of  experiments  were  designed  to  fix  the  photorefractive  holograms.  A 
summary  of  the  parameter  space  we  investigated  is  shown  in  tabular  form  in  Table  1.  Our  group 
and  others,  in  particular  at  Stanford  University  and  the  University  of  Southern  California,  have  not 
been  able  to  reproduce  the  room  temperature  electrical  fixing  results  published  by  Micheron  and 
Bismuth. 

The  state  of  the  photorefractive  hologram  can  be  determined  by  monitoring  the  two  writing 
beams  and  the  diffraction  efficiency  of  a  probe  beam.  Typically,  the  probe  beam  is  incident  on  the 
crystal  at  the  Bragg  angle,  and  two  beams  emerge  from  the  other  side  of  the  crystal:  the  diffracted 
and  undiffracted  beams.  The  absolute  diffraction  efficiency  of  a  grating,  Tiabs»  the  ratio  of  the 
diffracted  intensity  to  the  incident  intensity.  The  relative  diffraction  efficiency,  Tjrgi,  is  the  ratio  of 

the  diffracted  intensity  to  the  sum  of  the  diffracted  intensity  plus  the  undiffracted  intensity.  One 
can  convert  between  the  two  efficiencies  with  the  following  formula:  "nabs^rel^^'*^’  where  t  is 
the  interface  transmission  and  a  is  the  absorption  coefficient. 

The  optical  train  that  we  used  in  our  experiments  is  shown  schematically  in  Figure  56.  We 
constructed  our  optical  train  in  a  manner  which  provided  a  high  degree  of  mechanical  and  thermal 
stability.  The  output  of  the  argon  ion  laser  passed  through  a  half-wave  plate/polarizer  combination 
as  a  means  for  controlling  the  overall  beam  intensity.  The  beam  was  spatially  filtered  and 
collimated  in  order  to  produce  a  uniform  intensity  pattern  at  the  crystal.  A  portion  of  the  beam 
transmitted  through  the  first  beam  splitter  (BSl)  was  used  to  monitor  the  total  power  in  the  pump 
and  probe  beams.  The  reflected  beam  was  used  to  generate  the  two  writing  beams  using  a  second 
beam  spliner  (BS2).  The  two  beams,  designated  as  "A"  the  pump  and  "B",  the  probe,  were 
aligned  to  interfere  in  the  crystal.  The  transmitted  pump  and  probe  beams  were  monitored  by  the 
detectors  labeled  "A "  and  "B",  respectively,  in  the  figure.  The  detectors  were  used  to  monitor  the 
transfer  of  energy,  or  two-beam  coupling  gain,  while  the  gratings  were  written  in  order  to 
determine  the  photorefractive  grating  strength  and  stability.  These  detectors  had  interference  filters 
(514  nm)  in  front  of  them  to  improve  the  signal  to  noise  ratio.  The  probe  and  pump  beams  were 
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TABLE  I.  Status  of  Electrical  Fixing  of  Photorefractive  Holograms  in  BaTiO 
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Fig.  56.  Apparatus  used  in  hologram  fixing  experiments. 

comparable  in  both  transverse  dimension  and  flux.  An  electronic  shutter  in  the  beam  path  was 
used  to  control  the  crystal  exposure  time. 

Mirror  "A",  located  after  the  spatial  filter,  could  be  moved  into  the  beam  path  to  generate  an 
erase  beam.  This  beam  could  be  used  to  uniformly  illuminate  the  crystal  at  an  angle  other  than  the 
Bragg  matched  condition  for  erasing  the  photoreffactive  gratings.  Note:  if  one  of  the  writing 
beams  were  used  to  erase  the  grating,  new  gratings  could  be  written  because  the  diffracted  signal 
would  interfere  with  the  original  incident  "erase"  beam.  A  second  electronic  shutter  was  used  to 
control  the  illumination  time  of  the  crystal  to  the  erase  beam.  In  our  system  the  crystal  could  also 
be  illuminated  with  an  auxiliary  incoherent  source  to  erase  tlie  photoreffactive  gratings.  The 
advantage  of  using  the  incoherent  source  is  that  it  produced  a  spectrum  of  wavelengths  and  cannot 
produce  any  interference  effects. 

A  HeNe  laser  operating  at  0.6328  pm  was  used  to  generate  a  probe  beam  which  allowed  us 
to  monitor  the  evolution  of  the  gratings  in  the  crystal  during  the  entire  writing  process.  Note:  the 
longer  0.6328  pm  wavelength  in  conjunction  with  a  significantly  reduced  flux  (~5  pW)  insured 
that  there  was  minimal  erasure  produced  by  this  monitoring  technique.  In  order  to  simplify  the 
Bragg  matching  condition,  the  HeNc  laser  was  mounted  on  a  rotatable  platform  which  pivoted 
about  the  center  of  the  BaTi03  crystal.  The  crystal  was  held  in  place  by  a  custom  support  on  the 
end  of  a  post  holder  which  was  bolted  to  the  optical  table.  A  bearing  assembly  around  the  post 
holder  allowed  the  HeNe  laser  platform  to  freely  rotate  parallel  to  the  optical  table  centered  about 
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the  crystal.  The  incident  HeNe  beam  was  monitored  by  detector  "C"  and  the  diffracted  signal  was 
detected  by  detector  "E"  in  the  figure.  Both  detectors  had  514  nm  interference  filters  to  improve 
the  signal  to  noise  ratio. 

The  relative  diffraction  efficiency  was  determined  by  measuring  the  diffracted  signal  and 
dividing  it  by  the  sum  of  the  transmitted  zero  order  and  the  diffracted  signal.  In  order  to  conven 
these  numbers  to  absolute  diffraction  efficiency  absorption  would  have  to  be  taken  into  account.  In 
addition,  the  diffraction  efficiency  at  0.5145  |im  is  higher  than  at  0.6328  ^m  by  approximately  a 
factor  of  3/2  due  to  the  X'2  dependency.  During  the  experiment,  we  used  an  A/D  board  to  acquire 
up  to  eight  channels  of  data  simultaneously.  The  board  had  12  bit  resolution  and  could  operate  at 
up  to  250  KHz  on  a  single  channel. 

The  data  we  obtained  from  one  typical  experimental  run  is  shown  in  Fig.  57.  Figure  57(a) 
is  the  diffracted  probe  beam  as  a  function  of  time.  Figure  57(b)  is  the  transmitted  pump  beam 
while  Fig.  57c  is  the  transmitted  probe  beam.  Finally,  Fig.  57(d)  is  the  applied  electrical 
waveform  versus  time.  The  run  starts  by  allowing  the  two  write  beams  to  fall  on  an  erased  crystal 
at  time  t=l.  1  seconds.  A  transfer  of  energy  is  seen  to  occur  between  the  two  writing  beams.  As 
the  transmitted  probe  beam.  Fig.  57(c),  grows  in  amplitude,  the  pump  beam  is  depleted.  The 
signal  proportional  to  the  diffraction  efficiency  of  the  probe  beam.  Fig.  57(a),  grows  with  time  to  a 
peak  value.  At  a  time  41.6  seconds  into  the  run,  a  square  electrical  pulse  of  one  second  duration 
was  applied  with  the  waveform  shown  in  Fig.  57(d). 

At  that  point  the  photorefractive  gratings  are  disrupted  and  the  two  wave  mixing  gain  is 
impacted.  Therefore,  the  transmined  probe  and  pump  intensities  approach  the  initial  condition 
before  the  gratings  were  established.  However,  because  the  two  writing  beams  continue  to 
illuminate  the  crystal,  they  begin  to  rewrite  gratings  which  again  causes  an  energy  transfer. 

The  diffracted  HeNe  signal  monitoring  the  photorefractive  hologram  decreased  in  amplitude 
at  the  occurrence  of  the  leading  edge  of  the  electrical  pulse.  Then,  because  the  writing  beams  were 
still  on,  the  diffracted  signal  grows  slightly.  At  time  43. 1  seconds  into  the  run,  the  pump  and 
probe  beams  are  blocked  and  the  diffracted  signal  remains  essentially  constant  in  time.  The  crystal 
was  then  illuminated  with  an  incoherent  light  at  a  time  equal  to  49.6  seconds.  The  apparent  slight 
rise  in  the  amplitude  of  the  diffracted  signal  is  due  to  a  small  percentage  of  scattered  erase  light 
entering  detector  "E".  This  noise  occurred  even  though  we  used  a  an  aperture  and  interference 
filter  in  front  of  the  detector  to  minimize  the  effect. 

Using  a  generalized  curve  fitting  routine,  we  determined  the  functional  form  of  the 
diffracted  signal  in  the  presence  of  the  erase  beam.  The  diffracted  signal  decayed  with  a  single 
exponential  time  constant  to  the  offset  determined  by  the  noise  floor  produced  by  the  erase  light. 
The  erase  light  was  turned  off  at  time  75.9  seconds.  As  can  be  seen,  no  residual  fixed  grating  was 
present  in  the  crystal  which  would  have  diffracted  the  HeNe  monitor  beam.  After  the  grating  was 
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completely  erased,  the  crystal  was  "reset"  close  to  its  initial  state  before  trying  another  run.  This 
was  accomplished  by  re-polling,  i.e.,  cycling  through  several  hysteresis  loops  or  applying  half 
sine  waves  with  sufficient  amplitude  to  realign  the  domains. 

To-date  we  have  not  been  successful  in  electrically  fixing  gratings  in  BaTi03  crystals. 
However,  on  another  contract  we  were  successful  in  thermally  fixing  gratings  in  photoreffactive 
LiNb03  crystals.  We  achieved  fixed  diffraction  efficiencies  of  up  to  52%.  Thermal  fixing  may 
thus  be  a  reasonable  alternative  to  electrical  fixing  of  BaTi03. 

An  alternative  to  fixing  weights  is  to  continuously  refresh  the  weights  after  the  neural 
network  has  learned  the  solution  of  a  pardcular  problem.  This  can  be  done  by  interleaving 
occasional  learning  or  refresh  cycles  into  the  readout  phase.  Once  the  network  has  learned  a 
solution,  only  occasional  "reminders"  are  necessary  to  keep  the  network  from  'forgetting"  the 
solution.  This  technique  is  analogous  to  refresh  cycles  in  conventional  DRAM. 
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SECTION  9 
CONCLUSIONS 


During  the  course  of  this  three  year  DARPA  contract  we  demonstrated  the  application  of 
cascaded  grating  holography  (CGH)  in  optical  neural  networks.  By  distributing  each  connection 
weight  among  a  plethora  of  cascaded  gratings  which  vary  in  position  and  orientation,  CGH 
improves  holographic  optical  neural  networks  in  several  ways.  First,  Bragg  degeneracy  is 
eliminated  which  permits  the  placement  of  pixels  in  arbitrary  2-D  patterns  in  the  input  and  output 
planes,  leading  to  new  flexibility  in  holographic  recording.  Second,  CGH  reduces  distonions  due 
to  beam  coupling,  resulting  in  undistorted  weight  vectors  and  good  quality  reconstructed  images. 
The  flexibility  of  CGH  allows  the  design  of  compact  programmable  holographic  neurocomputers 
which  are  composed  of  a  single  photorefiractive  crystal,  SLM,  and  detector.  The  utilization  of  the 
SLM  is  maximized  because  every  pixel  can  be  used  without  fear  of  crosstalk. 

Under  this  contract  we  constructed  an  optical  neurocomputer  and  showed  its  flexibility  by 
demonstrating  several  neural  networks  without  adjustment  of  the  optical  components.  The 
networks  were  Perception,  bidirectional  associative  memory,  and  backpropagation  with  a  single 
hidden  layer.  We  also  demonstrated  CGH  in  the  infrared  using  compact  laser  diode  sources  and 
designed  packaging  concepts  based  on  laser  diodes. 

In  short,  we  have  demonstrated  the  viability  and  potential  of  CGH-based  optical  neural 
networks.  Future  work  should  include  refinement  of  current  algorithms  and  implementation  of 
new  ones,  improvement  of  storage  capacity  and  processing  speed,  demonstration  of  compact 
packages,  and  application  of  this  technology  to  a  real-world  problem. 
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APPENDIX  A 


OPTICAL  BACKPROPAGATION  PROGRAM 

SOURCE  CODE 


r  Ieam34n.c  Optical  Bacfcpropagalion,  November,  1992. 

This  C  language  program  implemants  a  three-layer,  single  hiddert-layer, 

bipolar  neuron  backpropagation  neural  network  in 

torward-scanaring  SPONN  which  maps  random  input  pattams 

to  random  output  patterns.  In  addition  to  the  host  IBMoompatUe  PC,  k  also 

controls  the  FG100  input  image  processor,  the  VS100  ou^HJt  image  processor, 

and  the  LCoetls  which  shuaar  the  R  and  O  beams.  Rough  output  coordirtates 

are  read  from  aligr(34n.dat  which  is  generated  using  a  separate  program,  alignvs6.c 

Rnal  ooordinatss  are  stored  in  algn34n.dat.  Local  search  and  sorting 

are  used  to  find  the  brightest  output  pixels.  Neuron  outputs  are  represented 

using  a  balanced  dual-rail  method.  Lt  and  L2  fields  are  spatialy 

rrterteaved  artd  up  and  down  weights  are  exposed  simultaneously 

to  toroe  equatty  befiween  the  two  sets  of  weights.  Neuron  pixel  size  is  adkistable 

Trirtary  quantization  of  weight  updates  is  an  option.  Thresholded  output  error 

measures  correctness  of  output  sign  only.  The  FG100  input  image  processor  must  be 

initialized  at  8  MHz  to  work  property. 

This  program  was  written  by  Yuri  Otrechko,  Hughes  Aircraft  Company,  301 1  Malibu 
Canyon  Rd  ,  Malibu,  CA  90265.  E-mail:  owechko@csfvax.dnet.hac  com 
310317-5839 

£)  Copyright  1992,  Hughes  Aircraft  Co. 
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ainclude  <graph.h> 

#indude  <ctype.h> 
tinclude  <sldio.h> 

Ihnclude  <oonio.h> 
tirxdude  <itex100.h> 
fvKfude  <math.h> 

•include  <prooess.h> 

•include  <time.h> 

•include  <floath> 

•include  <dos.h> 

•include  <search.h> 

•include  <sldlib.h> 

/*  First  bit  controls  laser  and  detector  LC  oeUs. 
second  bit  connote  obiect  LC  cell,  third  bit  controls 
detector  shutter  7 

•define  WRITE  3 
•define  REA04 
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fdefine  STANDBY  0 

Adeline  SPONNJNPUT  O)(34O.0)(d0000L,2.1 
edeline  SPONN_OUTPUT0x360.0)(d0000L2.1 

int  brandO,getbits(): 

void  oonf_semples().readout_L10.readout_L2(); 

void  change_sign_L1(),change_sign_L2(): 

void  exe(nplar_randoni_L0(),disp_oell().a‘anster_L1(); 

void  train_W1  (),train_W2(),backwaid(): 

int  round().oon(ipareO,hsgn(),satur(): 

float  error_leam().error_read(),fsigmo(),ghump(): 

float  comparB_sign(),avg_rpixel(): 

FILE  *topen().*slream: 

intxoul(32I64I16].youl(32|64I16|,xt(100],yl(1001: 

int  xin_ot^[32],yin_ofaj(64],mxniax,mynnax,nimax,seed.seed2: 

int  xin_relI32],yin_ref{64],npixels.8ize; 

int  anaydim.xmax.xmaxi  ,ymax.ymax1  .yrnax2.arraydim2.base1  ,base2; 

nt  pxout(32|64],pyoul(32](64J; 

long  int  on_tinie.off_time; 

float  sumi  [32I20).sum2(32]I20]: 

float  yy1[16I201.yy2(16I201.delta1[16K20|.dalta2(16I20]: 

float  lotofTt500J,pfoLw(500][21.prpLrt5001[2).totefr_read(500J,lgamma; 

struct  doedatej  dale: 
struct  dostimej  timel : 
stoict  dostimej  time2; 

main() 

I 

int  n.m.mx.my.i.j.k.nepoch,nepoch2.ntolal: 

mt  err_value,nframes.nxavg2,nyavg2.pixval; 

int  gain.olfsetval,xba5e,ybase^oomval.v1(2S6]; 

int  It  .i2.nx.ny.nxavg,nyavg,value.shift.deoquanLdedut.declut2.decAlign: 

long  int  delay3,int_time.lval,klong; 

long  Int  Hime.prevtime: 

float  flime.looptime,gain2.otttime.offtime.erTor.f  1 .12; 

float  inttime,inttime2.fval,eps1  .ep52; 

char  commenl(80].oomment2(80|.ooniment3(80I.sienip(200I.blk.ch: 

b8se1-592;  r  5e2»2S0h  PP1 1  b«e  address  V 
base2-596:  r  S9&<2S4h  PPI 2  base  addtess  V 
blk>219;  /*  Solid  block  ASCII  character  */ 

/*  Set  mode  of  DI048  VO  card  for  Mode  0,  al  outputs  */ 

autp(baBe1-»3.128): 

outp<baae2-f3.128), 

it  ((straam>fopen('leam34n.inp'.*r'))»NULL) 

{printfrcannol  open  leam34n.inpVi*); 

exit(0);| 

else 
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pnntf(*leam34n.inp  parameter  file  opened  tor  readingVi'); 


fgels(oomfnent2.80.streafn): 
tgetstoominent, 80, stream) ; 
tgets(comment3,80.strBam) : 

printfCEnler  number  of  exemptais  in  x  and  yVHmxinax<«16  and  mymax<=15  for  64x64  display  fields);  ‘): 

tgets<siemp,2005lream); 

tscanl(stBam,*%d  %dfo',&mxmax.&mymax): 

pnntfC^  %dfo',mxmax,mymax); 

mmax-mxmax*mymax: 

pnntf(*No.  of  exemplars^  %d\n*,mmax): 

printf('EnlBr  zoom  value  for  display  field  size  (3->64x60, 2->128x120, 1->256x240, 0->512x480)  *); 

fgetsf  stamp, 2003lFeam); 

lscanf(stream,*%d\n',&zoomval): 

printf(*%<^n',zoomval) : 

arraydim>pow(2,9-zoomval) ; 

afTaydim2«aneydim/2 ; 

pnntffOisplay  field  dimension  is  %d\n‘,arraydim); 

pnntff'Eniar  xmax  arfo  ymaxtfor  64x64  arrays,  xmax^  &  ymax=S7):  *); 

fgeis(siBmp,200.siream) ; 

fscanf(sliBam,*%d  %din‘,&xmax,Aymax): 

printffSd  %(fn*,xmax,ymax); 

xmax1>xnnax/2; 

ymaxUymax/3; 

yrrtax2>2*yrT«ax/3; 


pnntf(*EniBr  subneuron  edge  dimension; '): 
lgets(slBmp,200,sieam) : 
tscanf(a^wlam,*%t^n•,6size): 
pnntt(*%(fn‘,size); 

prwTtf(*Enler  1  to  realign,  0  ofoenwise  '), 
lgets<stBmp,200,s»eam); 
lscanf(steam,'Kdfo',&d9cAlign): 
pnnirr%cAn‘,dacAlign) ; 

printff'EnlBr  no.  of  epochs  up  to  500;  ’): 
fgets(stBmp,2003team) ; 
lscar4(stream,'%(An‘,&nepoch), 
pfintl('%<^n',nepoch); 
nepach2>nopoeh;  . 

pnntfrEniBr  exampiar  miegration  time  in  sec; '), 
fgetB(stBfnp,200,steam): 
tecanl(steam.'%hn',&innime). 
pnntl{T(lif\n",inllme): 

phntff’EntarOufout  Frame  Grabber  gain  from  1  lo4;'); 

fgets(stamp,200«team) , 

fscanf(slnam,*%hn*,6gain2); 

ptinlfr%hn*,gain2); 

gain»255*gain2r4; 
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pnntf(*EniBr  Output  Frame  Grabber  otlset  trom  0  to  255; '); 
fgetB(siBmp.200«tBam) : 

(5canl(stBam,*K(An*,aottsetval); 
prmtf('%dVt'.off8ehraI) : 

piintfCOulputFreme  Grabber  input  LUT:0  tor  unmodHied.  1  iorsqrtVt*); 
tgels<siBmp,200a*Bam); 

(scanf(slream,'%(An*,&dBdut): 
pnntf(‘%(An',deciut) : 

printfCIttput^rante  Grabber  output  LUT:0  tor  unmodHwd,  1  tor  linearized  outpubn*); 

fgets(slBmp.200abeam); 

fscanf(strBam,'%din*.&dBdut2): 

pnntf(''%(to'.d8dut2) : 

pnntff  Enter  iaigmo  gamma  value:  *); 
fgets<si8mp, 200, stream) . 
lscanl(stream.'%N)*.&fgamma) : 
pnntfC^fin'.lgamma): 

printf(*EnlBr  1  tor  binary  quantizalian  o<  weight  updates.  0  otherwiseVt*): 
fgets(stBmp,200abeam) ; 
lscanf(saeam.‘%(^'.&daoquant); 
prmtf  ('%^*,daoquant) : 

prinll(*EtaBrepa1  toryl  quantization  and  epe2  tor  delta  1  anddelta2quantizaiionV«'); 
tgets(stBmp.200atream) . 
iscant(stiaam.'%f  %hn'.&eps1  .&ep52]; 
pnnttC^f  %rbt‘.ep6l  .eps2): 

pnntfl'Enter  SLM  turrvon  and  turn-off  times  in  sec; '); 
fgels(stemp,200,sbeam) ; 
lscar«t(straam.*%t  %hn',&onlime.&offtime): 
printff%f  %hn'.onime,offlime): 

pmtfCEntar  x  and  y  dimensiorB  of  averaging  regions  (odd  numbers<>9); '): 

fgets(stBmp.2004Veam); 

fscanl(sbaam.*%d  %dv)',anxavg.&nyavg); 

pnntfC^d  %(fn'.nxavg,nyavg); 

nxavg2>(nxavg- 1  y2: 

nyavg2>(nyavg-1)/2, 

pmtfCEntar  no.  of  samples  per  subneuron  (<>16):  '); 
lgets(siBmp.200at8am) . 

Iscanl(steam.''%(fn*.  Anpurels) : 
prinlfC%(^‘.npmsls): 

printICEntar  no.  of  video  tames  to  average  (<-100):  ■): 
fgets(slBmp.200ataam); 
fscant(s>eam,‘%<fn*.antrames); 
primf(*%(to',ntoBmae): 

prinit(*EniBr  seed  imager  tor  random  number  generator:  *); 


97 


(gei5(siB(np,200,snam) . 
iscant(8tiBam  ,''%d\n'.&seed) : 
pnntt(*%(An‘.saed); 
seed2>SMd; 

fciose<slreafn); 

/*  Turn  oft  Input  Frame  Grabber  */ 

8ethdw(SPONNJNPUT); 

seldim(1024.1024.12); 

^0Off(); 

r  Set  up  Output  Frame  Grabber  */ 

tethdw(SPONN_OUTPUT]: 

initializaO; 

exi>ync(); 

seicamera(0): 

stBtic_luti(): 

setBain(gain): 

se(oftMt(offaetwal); 

r  MorfV  INPUT  LUT  of  Output  Frame  Grabber  V 
if(daclut){ 

lor(j»0;j<»255j+-»)  { 
v1[j]«round(aqrt(25S.)*sqr1(j)):  ) 
waluKINPUT.0.0.2S6.v1): 
seihJtflNPUT.O); ) 

r  Set  oiAputLUTs  of  Output  Frame  Grabber  for  display 
if(declut)( 

ter(i^j<»255;j*+M 
v1[j]-raund(pow0.2)/2S5.):  ) 
walut(nED.0.0.2S6.v1): 
v1iO)-2S6; 
v1(2S6H>: 

«raiut(BLUE.0.0.2S6.v1): 

v1((4>0: 

«raMGREEN,0.0.2S6.v1 ); 
se«ut(GREEN.O);L 

for(jaO;(<>2S5j-»-»){ 

vi(iH;  1 

«valut(RE0.0.0.2S6.v1). 

vip}.2S6: 

vt(2S6)-<>; 

iMa(ul(8LUE.0.0^.v1); 

«»alul(QREEN.0.0.256.v1); 

mMQREEN.O);| 


grab<0). 


/*  Set  up  Input  Frame  Grabber '/ 

swilcb(dedut2) 

{ 

caseO: 

for(j>0J<>255;j4-»)| 

viDH;} 

break; 
case  1: 

if((stream>fopen('lmear.lut*.*r’))MNULL) 
{pnntf(‘cannot  open  knear.lutVi'): 
e»KO);} 


pnnt((*\nlinear.lut  file  opened  lor  raadingNn’); 
lor(j-0:i<>25S  )-«-»){ 

Iscanifstream  .*%din',a  V 1  (tl) :} 
fdoeefsiraam); 
break; 

1 


•goffO; 

sethdMr(SPONN_INPUT); 

Igonf): 

Mralut(GREEN,0,0,2S6,v1),  r  Adjust  Input  Frame  Grabber  output  LUT  */ 
walut(RE0.0.0.2S6.v1); 

V1(2S4H>; 

vt(255)>0; 

walut(BLUE.0.0.2S6.v1); 

aclear(0.0.1023.1023.0); 

zoomfzoomval); 

disp.oefKO.O); 

/*  Cakbrats  time  base  V 


timefaitime); 

prevtime>itime, 

k)r00ongl4;kiong<2000000;klong■^■^); 

timo<&ltime); 

ftimo-llime-prevtime; 

looptima«liima/i!000000.0; 

int_time-inttimeHooptime; 

on.tene-onimefloaplime; 

ofl_time»offtimaflooptime; 


/*  Ganerale  SPONN  input  ooordinalB  vactars  */ 


lor(i-0;i<xmax;i-»4^H 

mn_rafri)-arraydim24*siza;) 

tortiaO;i<ymax;i-f'»)| 
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yin_ref(IJ*i"sae,) 


tor(i«0;i<xmax  )( 

xin_ob|i]>i*sizeJ 

ior(i>0:i<yiTiax:i'f-f){ 

r  Store  zero  pellam  in  (0.0);  re(-on  in  (1.0);  object  subeets  A  and  B 
in  (3.0)  and  (4.0).  lespedivety.  */ 

xbase>arrayditn. 

ybase-0; 

tor(j>0;j<yfnax;j-f-f){ 

for(i-0;i<xmax;i-«-f){ 

bk)ck(xbaa»fan_refri].ybaBe-^yin_fe((naiza.aize.2S5); 

I) 

xbase«3*arraydifn; 

ybase-0; 

for(j»0;j<yfnax  ;!+♦)( 
shi(ti>ceil(fmod(j,2.0)) , 
for(i>0;i<xmax;i*«2){ 

btock(xhaie-»xin_obj(i->8hi1^.ybase-fyin_obi(|).si2e.si2e.2SS); 

)1 

xbase>4*arraydiin ; 
ybaaeaO; 

k>r(jaOj<yfnax;j-^-f){ 
shift- 1  -ceil(ftnod(j.2.0)) ; 
tor(i-0;Kxtnax;i«-2)| 

**>‘*(****®+wn_obj|i+8hiftl.ybase+yin_obUi).8ize.8ize.255); 

1) 


r  CakxjMa  ouftxjt  oooidinMH  and  fine-iuna  by  searching 
locai  neighborhoods.  ’/ 

iftdecAlign) 

{ 

i(  ((s»eam-topen("^!ign34n.dar,V))— NUU) 
|piinll(lncanno(  open  align34n.datVi*). 
exit(0):} 

alse 

pnnif('V<align34n.dBt  coortlnaie  Me  opened  kx  rsatingy)') 

tor(i-0;i<ymax  ;)♦«){ 
lar(i-0;i<xmax;H«){ 

lMani(sMam.*%d  %d  ‘.Axout(i]0]IO).ayou4gi|OI)  j 
l8can1(slroarn.Vt');) 

>ciose(siream): 
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dBp_oell(3.0):  r  Turn  on  all  obiect  ‘A'  pixels  */ 


»0O«(): 

selhdie(SPONN_OUTPUT); 

•gonO; 

printfOnAdiuBt  bhghmess  then  hit  any  key  to  save  output  cooidinalaeto*); 

outp(base1.REAO); 

nxavg2><nxavg-1V2: 

nyavg2x<nyavg-1V2; 

tor(my>0:fny<ymax:my-f-f)| 

for(mx>0;mx<xmax.nrix-»'4){ 

pxou1(mx]{myl>xoiri(mxXniy){0]: 

pyoul|mxHiny)-youl(fnxHmyI01:}) 

ter<;;M 

for(myaO:my<ymax;my-t'-f){ 

shift>oeil(finod(my.2.0)): 

tor(mxaO:mx<xmax;mx-f>2){ 

ractartgle<pxoul(8hift'fmxImy)-nxavg24)yout(shift-i'mxImy}-nyavg2,nxavg,nyavg,2S5): 

iectangle(pxout(shM+(nxHmy}-nxavg2.pyouXshift-»mx][my}-nyavg2.nxavg,nyavg,255):}) 

snap(WAIT): 

if(kbhit())tora8k: 

) 


ch>gelch();  /*  Empties  key  butter  */ 

pnnttCVtConCguring  output  pixel  ooontinatas,  please  wait  .\n'); 

<gott(), 

seihdw(SPONN_INPUT). 

IgonO. 

disp_oell(0,0); 

•gottO; 

8ethdw(SPOJN_OUTPUT) ; 

IgonO; 

lortmy>0:my<ynMu;my'f'»)( 
shittaoeil(fmod(my.2.0)): 
lor(mx>>0;mx<xmax  ;mx  ♦>2){ 
oonl_sampt8s(shitt-»mx,my,nxavg2,nyavg2)J} 

IgotIO, 

solht*e(SPONN_INPUT);  r  Twn  on  an  obiect  "B"  pixels  */ 
IgonO, 

diap_oall(4,0): 

lvala3*on_lime: 

lor(ktong>0Mon9<lval;klong 


igollO; 

settidw(SPONN_OUTPUT) : 
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»gon(). 


snap<WAIT): 

sethdw(SPONNJNPUT): 

•gooO; 

disp.celKO.O): 

*0O«(); 

selhdi«(SPONN_OUTPUT); 

«gon(); 

tor(my»0;my<ymax;my++)| 

Shifts  1 -oe«l(finod(my. 2.0)); 
for(mxs0;mx<xmax:inx-t's2){ 
conf_safnples<shift-ffnx,my,nxavg2.nyavg2)J} 

for(my>0;my<yinax:my-f->)| 
tor(m  xsO  ;mx<xtnax  ;mx  ■!■♦ ){ 
ior(ia0:i<npixel8;i4^-i'){ 

biocfc(xoul(mx)(my](j].youlffnx]{my](i],si2e.size.25S);))j 

phntf(^nOuft>ut  pixels  marked  in  red  on  monift>r\n’): 

IvalaS.Oteoplitne; 
for(UongsO;klong<lval:ktong-f -t-) ; 

r  Write  output  ooorcfnaies  to  Me  algn34n.dat  V 

if  ((stream«fopanCalgn34n.dar.*w*))ssNUU.) 
fpnntfOncennot  open  align34n.da(Vr'). 
exiftO):} 
else 

printfOnaign34n.dat  ooordinalB  file  opened  tor  wiitIngVn') ; 

for(mysO:my  <ymax  :my-t-+)| 
fortmxsO:mx<xmax;mx«-f){ 
lorfisO:i<npixels;i-f>){ 

lprinlf(snam.*%d  %d  '.xout(mx](my](i|,yout(mx]|my]{i|);} 
fpriniftstream.'Vt*):)) 

IdoeefsiiMm); 

9»«b<0), 


if  ((sireem»topenCelBf>34n.der.V))MNUU.) 
{printfOncannal  open  algn34n.iMVi*); 
exiftO);) 

elee 

piinlfOnelBr«34n.dBt  oooRfnale  Me  opened  lor  readeigVi’). 

for(mysO;my<ymax;my<f«){ 
fortmxsO;mx<xinax  ;mx-0'»){ 
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tor(isO:i<npixels  .i+'f ){ 

tscanf(stream.*%d  %d  ‘,&xout(mx](myIi],&youl(mx|my](j]):} 
tscanf(slream  .'\n*):}) 

tck>se<strean)); 

<go«(); 

seihd««(SPONN_OUTPUT) ; 

•gonO; 

gratKO): 


/*  Store  LO exetnplars  in  (0,1)  to  (mxmax-l.mymax)  V 
•goffO: 

sethdw(SPONNJNPUT): 

•gon(); 

for(mxaO;mx<inxnnax:mx>-t'){ 

tor(my»0:my<fnymax:my-f+)( 

exernplar_randoni_LO<mx,niy);)) 

r  Initialize  weights  using  sum  ol  random  outer-preducts  */ 

phntf(Vilnitializing  weighs  wHh  sum  of  rarKtom  outer-producS.  \n\nHit  any  key  to  begin  leaming\n'); 
Ivala4*int_time: 

outpfbasel  .WRITE); 

r  (LOKLI  )  Weight  initializaiion  vectors  in  (S.O).  V 

ybase>0: 

n»5; 

xbase>n*arraydim; 

srandfseed^-IOOOO+n): 

for(my>0;my<ymax1  :my>-f)( 
for(mx>0:mx<xmax;mx-fs2){ 
if(bfand()-«-1M 

block(xin_r8f(mx|-»xbase,yin_refimy)+yt>ase.size.size.255): 
block(xin_rellmx-f  1  ]+xbase,yin_ief(myj*ybasesize.size.O);) 
else| 

block(xin_ralfrnx)>xbase,yin_relImy]-»ybase,size,size,0); 
blockfxin.reqmx-)'  1  ]>  xbase.yin_rel(my)-fybaseaize.stze,255);| 

H 

tor(my>0:my<ymax1  ;my++)( 
lor(mx>:0:mx<xmax:mx+<2)| 
if(brand()— 1)1 

block(xin_obj(mx]-»xbOBe,yin_obi(my}+ytMse,size,size.2S6); 
blocfc(xin_obj(mx-»1  )■»  xbase,yin_ot)j(my]-»ybaae  jiza.size.O);) 

etae{ 

blecM)dn_obi(mx|->xtieae.yin_obi(my)-»ybase,size,sge,0); 

blocli(xin_otii^x-»1)>xtMse.yin_ob|my]»'ytMM,size.s<zs.25S);| 
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)) 


r  (LI  )(L2)  vectors  lor  weight  irtitalization  in  (7,0).  */ 

ybase^: 

n=7; 

xbase>n*arraydi(T), 
srand(seed-t-200004n) : 

tor(my-0;my<yinax1  ;myf -f)! 

Ior(nui>0;mx<xmax;mx>a2){ 
if(brand()— 1){ 

blod((xin_rel(mxKxba6e,yin_ral[2*fiiy-fyinax1}4'yfaase,size,size.255): 

block(xin_relImx-f1]-t'Xba6e,yin_rel|2*iny'fyTnax1]+ybase,size,size.O); 

blocfc(xin_obi{mx]'fxbase,yin_ob|2*iny+ymaxU1]-fyt>ase,size3ize,255); 

block(xin_ot)j(mx't-1]-fxbase.yin_obi(2*iny+ymaxU1|-»^y))ase.size,size.O): 

1 

else( 

block(xin_ref(mx]-fxbase,yin_rBfI2*my+ymax1]-fybase,size.size,0); 
block(xin_ref|mx-f  1  )4'XtMBe.yin_ie([2*my-t-ymax  1  ]+ybasa,size.size.255); 
blocfc(xin_ot)imx]-fxbase,yin_obj(2‘my'fymax1+t]4.yt)8se,size.size,0); 
blod((xin_obi{mx-f1]+xbase.yin_obj(2*my-t'ymax1+1]+ybase.size,size.255); 
1 

I) 

Ior(my«0:my<ymax1  ;my++){ 
lor(mxa0:mx<xiTtax;mx>s2){ 
if(brarKl()<i«-1)( 

biock(xin_re((mx)-»xbese.yin_rel(2*my'fymaxl4'1|4'ybase.size, size, 255); 
bk>ck(xir\_rsllmx>  1  )+xbase,yin_ref(2*my-«'yinax1  *  1  |+ybase,size,slze,0) : 
biock(xin_ot)jmx|+xbe8e,yin_obj(2*my-»yinax1]>ybase,s(ze.size.255); 
bloch(xin_ob|nix+1J-»xbaso,yin_obj(2*my+ymax1)+yt)ase^ize,size,0); 

I 

else{ 

block(xin_rBl(mx]-»xt>ase,yin_rell2*my+ymax  1 + 1  )•^ybase,size.size,0); 
black(xin_rBf  mx-f- 1  l+xbeBe,yin_rell2*my+yinaxl +1  l+ybase.size,size,255); 
block(xin_obi(nnx|-«'Xbaae,yin_obj[2*my -t-ymax  1  ]+ybase,size,size,0); 
block(xin_ot)j{mX'^  1  j-f  xfaeBe,yin_obj{2*niy  ^ymax  1  ]■^yba5e,5ize.size,255); 

I 

» 


diBp_oel(5,0): 

lor(ldang«0;tdong<>^alMong-»-f); 

diBp_oal<7,0): 

lor(klong-0:ldong<«int_timeMx)g4'«) ; 

seed^-f; 

il(Kfahit())bfeak:} 

ditp_oel(0.0): 

outp(ba8e1,F£AO); 
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ch=getch(): 


r . . 

r  LaarrangLoop  */ 


_dos jetlime(4limei ); 

1 1  =2.(V(mmax’xmax‘ymax  1 ); 

tor(n=0;n<nepoch;n-t'-t')( 
error  =0; 

tof(my»0:my<mymax;my++)| 
for(mx>0:mx<mxmax:mx4^-f){ 
disp_oell(mx,my+ 1 );  /*  Input  LO  exemplar  (mx.my)  V 
raadout_L  1  (nhames ) . 
transter_L1(); 
readout_L2(n<tames): 
err_value=efTor Jeam(  mx .  my  ,n) : 
en’or=eiTor+err_value, 
backwarcKntrafTies) : 
lrain_VV2(int_lime,deoquanteps  1  .eps2) : 
lrain_W1(int_time.deoquant.eps2,mx,my);}) 

totert(n|=f1 ‘error; 

pnnifCVtljeaming  Epoch=  %3d  Errors  %5.3f  ‘.n.toleiTln]), 
value=*tatBnfn]'40: 
tor(i>0:i<value,4+)| 
prjntfC%c',blk);) 

it(kbhit())| 
ch*^lch(); 
nepoctvsn. 
break; ) 

I 

_dos_gettime(4time2) ; 


r  Readout  Loop  */ 

r . - . / 


pnntf("\nViReadout  pf  network;\n\n’); 

k>r(na07i<nepoch2;n-r+){ 

error-0, 

tor(  my -0  :my<tnymax  ;my -r-r ){ 
for(mx>0;mx<mxmax  ;mx-^-f ){ 
disp_cell<mx.my-f  1 ); 
raadout_L  1  (ntrames) ; 
lran8lBr_L1(): 
rBadoul_L2(n#anwB); 
err_walu8-afTor_tBad(mx.my  ,n) ; 
efror-atror4arr_value 
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)1 


tolen’_read(n]sf  1  ’error; 

printfOnReadout  Epochs  %3d  Errors  %5.3f  '.n,lolerr_r8ad(n]): 
valuesiotBrr_r8ad(n]*40: 
for(isO:i<value;i-f+){ 
printf(*%c*.blk);) 

if(kbhit()){ 

nepoch2sn; 

break;) 

1 

outp(base1  .STANDBY): 

disp_oelK1.0); 

tgo«(): 

sethdw(SPONN_OUTPUT): 

»gon(); 

grab<0); 

if((streamsfopen(‘1earn34n.csv*.*w*))==NULL) 

|pnntf('cannot  open  Ieam34n.csv\n‘): 

exit(O):) 

else 

prmtfCVileamSAn.csv  file  opened  for  writing\n*), 

fpnntf(stream,*learn34n.c  Two-Layer  Optical  Backprop.  with  Symmetric  Dual-Rail  Levels  (Random  Mapping  Problem)\n’); 
_dos jgeldaM&daie) : 

fpnntffstream.’Dale;  %d-%d-%d  ’.date.month.date.day.date.year); 

fprintffstream.’Crystal:  %s’,comment2), 

tpnntf(slroam.’%s’, comment): 

tpnntf(  stream  .■%s”,comment3) : 

tpnntt(stream, ‘Enter  number  of  exemplars  in  x  and  y:\n'): 

lprintf(sfream,‘%d,  %d\n",mxmax.myrrwx): 

fprintl(stream.‘zoomval=  %dVn‘.zoomval): 

fpnntffstream.'Display  field  dimension  is  \n%d\n‘.arraydim); 

fpnntf(stream,'xmax  and  ymax=  \n%d,  %d\n‘.xmax,ymax): 

fval=xmax*ymax1-t'2'xn)axl*ymax1 : 

fprintffstream  .‘No.  of  neurons=  Vi%e\n‘.fval): 

fvalsxmaxrymaxl ; 

fvais(xmax*ymax  1  )*(fval)-t'(fval)*(fval): 

fpnntf(stream,*No.  of  weightes  V)%e\n',fval); 

fpnntf(stream, 'Enter  subneuron  edge  dimension :\n%din'.size): 

fpnntffstream.'No.  of  epochs:  *): 

fpnnlf(stiBam.‘%<An*,nepoch): 

Ipnntffstraam, ‘Exemplar  integration  time  in  sec:  ‘): 
tpnntl(siream,‘%Nr',inttirr»): 
fprinif(stream.'Ou^t  Frame  Grabber  gain: '); 
fprintf(stream,‘%hn‘,gain2). 

4>nrrtl(steam,‘Output  Frame  Grabber  offset  ‘): 
fp(intf(8taam,‘%dVi‘.oHietval); 

tprintf(strBam,‘Oulput  Frame  Grabber  input  LUT:  1  tor  square  root;  0  for  Nnearin'): 
fp«intf(straam,'%<to*.deciut): 
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tpnntt(siream.‘lnput  Frame  Grabber  oulput  LUT ;  0  tor  unmodified:  1  tor  lineanzed  SLM  intensity\n‘): 

tpnntf(stiBam.'%d\n‘.declut2): 

lpnntf(siream.‘EniBr  isigmo  gamma  value\n'): 

tpnntf(siream.*%An'.fgamma); 

fprintf(stream .‘Enter  1  for  irinary  quantization  of  weight  updates;  0  olheiwise\n‘); 
fprintf(stream,‘%d\n‘,deoquant); 

fprmtHsveam, ‘Enter  epst  tor  y  t  quantization  and  ep62  tor  deltal  artd  dena2  quantizationVi'); 
fprintt(stream.‘%f.  %fin‘,eps1,eps2); 

fpnntt(stiBam,‘LCLV  turn-on  and  turn-off  detays  in  sec>  \n%f,  %hn‘,ontme.ofttime): 

fphntf (stream .‘Enter  x  arxi  y  dimertsions  of  averaging  tegiorts  (odd  numbers)\n‘): 

fprint((stream.‘%d.  %(hn‘.nxavg,nyavg); 

fptintf(s«Bam.‘EnlBr  no.  of  samples  per  subneuron\n%dn‘.npixels); 

fprintffstream.'Enter  no.  of  video  frames  to  average  (<-100)\n‘); 

fprintf(strBam.‘%d\n‘.ntrames): 

fphntf(stiaam.‘Enler  starting  seed  for  random  number  generatorVi*); 
fprint((stieam.‘%dyi‘.seed2); 

fprtntffstraam .‘Learning  start  time;  %df)tel^  Step  time;  %d;%d;%^‘.time1  hour, 
ttmet  .mnute.timel  .seoorKf.time2.hour.time2.minute.time2.second): 

fpnntf(stream.‘\n\nLeaming  Error  and  2  Output  Projections  vs.  Epoch  Vr\n‘): 
for(n»0:n<nopoch;n+-f){ 

fpnntf(stroam.‘%d.  %f.  .%d.  %f.  %hn‘.n.toterT(nl.n.prDLw[nl(01.pfoLw{nI1]);l 

tprintf(saeam.'>n\nReading  Error  and  2  Output  Projectiorrs  vs.  Epoch\n\n‘): 
tor(n>0;n<nepoch2:ni-f){ 

fpnntf(stroam.‘%d.  %f.  ,%d.  %f.  %f\n‘.n.toterr_rBad(n].n.projLrInJ[0].proi_r{nll1]);) 
tclose(stream). 


"  Tbs  function  rounds  a  floating  point  number  to  the  nearest  integer  */ 

mt  round(fval) 
float  fval; 

I 

intval; 
float  remain: 

remain  sfval-floor( fval ) : 
if(rematn>s0.5) 
valsoe<(((val): 
else 

val^floorffval): 

returrKval). 

I 

/*  This  function  finds  the  sample  points  for  neuron  (mx.my)  ‘/ 

void  oonf_samplBS(mx.my.nxavg2.nyavg2) 
int  mx.my.nxavg2.nyavg2: 
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nt  count, xts,yts,ij.nurT)(100); 


x6>pxout(mx]imy|: 

yts-pyout(mx](iny|: 

r  Find  brighiast  pixels  by  sorting  V 

tor(i30:i<100:i-t-f)(nuni[i}>0;) 

oountsO: 

for(j>-nyavg2-f  1  j<«nyavg2;j++){ 
for(is-nxavg2>  1  ;i<=nxavg2:i>-»')( 
xt(oount]>xt5-fi; 
yt(oount]«yts-»j: 
num(ooun4>oount: 
oount-f -*-;}} 

qsort(num,  1 00, si2eof(int)  .compare) ; 

for(i=0:i<npixals;i-t-«'){ 
xout(mx|my](i]>xt(num(i]] : 
yout(mx](my][i]>yt(num(jl] ;) 


r  Descending  order  oomparison  function  for  qsort().  It  compares 
sampled  pixel  values.  V 

int  oompare(i.j) 
infi.*j; 

I 

mt  val.a.b. 

a-rptxel(xtl*jl.ytl*jj); 

b=rptxel(xt(*i),yt(*il), 

valaa-b: 

retumfval); 

I 

r  This  function  calculalB^LI  values  */ 

void  readout_L1(nframe6) 
intnframes: 

{ 

mt  i1  .i2.mx2.my2,val; 
kxigntklong: 
float  fval: 


•goffO: 

•elhdw(SPONN_OUTPUT); 

igon(); 
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for(my2^;iny2<yma)(1  :my2+-*'){ 
for(mx2>0;mx2<xmax;mx2+-f)| 
sum1|mx21nny2]>0:}} 

outp(t»se1.REAO): 

<or(klong«0:klong<on_tinie:klong-i-f): 

for(i2«0;i2<nfranies;i2>-f){ 

snap(WArn: 

fortmy2-0:iny2<ymax1  ;my2+>){ 
tor(mx2«0;mx2<xfTiax;mx2+-f)| 
Sufn1|inx2](my2]>suni1(mx2](my2]>avg_rpixel(mx2,my2);}} 

} 

outp(base1  .READ): 
fval^sl.CVnframes; 
tor(my2sO;my2<ymax1  ;my2-t-)-)| 
for(mx2>0;mx2<xinax;mx2-t-f)( 
sum  1  [mx2][my2)sfval*sum  1  [mx2]f  my2]:}} 

for(my2aO;my2<ymax1  :my2-t'+)| 
for(mx2>0:mx2<xmax1  ;mx2+-»^){ 

yy  1  |mx2](my2]«fsigmo<sum  1  {2‘mx2](my2)-6um  1  (2*mx2-»  1]{my2]);}) 

1 


r  Transfers  LI  output  to  LI  input  */ 

voidlransfer_L1() 

{ 

int  mx2.my2.pixval: 

•golfO; 

sethdw(SPONNJNPUT): 

»9on(); 

disp_oelK0.0). 

for(my2>0:my2<ymax1  ;my2++){ 
for(mx2«0;mx2<xmax1  ;mx24^+){ 
ptxval«127*yy1(mx2|my21: 

block(xin_relI2*mx2],yin_ref(2*my2+ymax1].si2e.sae,127+pixval), 

block(xin_ref|2*mx2-)-1].yin_rBf[2*my24^ymax1].size.size.127-p«xval). 

}> 

1 

r  This  function  cafcUaies  L2  values  */ 

void  readout_L2(ntrames) 
int  nframes: 

{ 

intil  j2.mx2.my2,val, 
long  int  klong: 
float  fval: 
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sethdw(SPONN_OUTPUT) 

•gonO: 

for(my2«0;my2<ymax1  ;my2-f-f)| 
for(mx2a0;mx2<xmax;mx2-t'4-){ 
suml(mx2]|my2]>0: 
sum2(mx21my2)>0:}) 

outp(base1.REAO): 

for(klong>0:klong<on_time;klong-«'-f); 

for(i2<0:i2<nfranies;i2  ■«’•)'){ 
snap(WArr); 

tortfny2>0;my2<yinax1  :my2-«-f){ 
tor(mx2-0;mx2<xmax:mx2-«"f){ 

Suin1(mx2][my2]«sufn1(mx2](my2]'favg_rpixel(mx2,ymaxl4-2*iny2):}} 

I 

change_sign_L1(); 

tor(klong«0:Mong<on_time;klong+-f); 


*goH(); 

seihdw(SPONN_OUTPUT): 

•gon(). 

for(i2«0;i2<nframes;i2-)-+){ 

snap(WArT); 

tor(my2«0;my2<ymax1  ;my2-i-')-){ 

Ior(mx2*0;mx2<xmax:mx2f-f)( 

suni2(mx2](my2]«sum2(mx2]{my2]-favsLrp<xel(mx2.ymaxU2‘my2);}) 

I 

outp(baie1.REAO); 

fvaI=0.S^irame6: 
tor(my2«0:my2<^  •'axl  ;my2++)| 
for(fnx2«0:rtix2<xh''ax:fnx2++)( 
sum  1  {mx2Kmy  2]>fval*sum  1  (mx2](my2]; 
sum2(mx2Hmy2)>ivarsum2(mx2)[my2]:}) 

for(my2«0;my2<ymax1  ;my2-f+)| 
fortmx2a0;mx2<xmax1  ;mx2-f-f){ 

yy2|mx2Hmy2)»fs»gnio(sum1(2*mx21(my2)-sum1(2*mx2+1I|my2J-sum212*mx21[my21+sum2(2*mx2+1)lmy21); 

II 

1 


r  This  routine  spaliaHy  averages  bright  pixels  */ 


float  avg_rpixeHmx,my) 
int  mx.my; 

I 

intj,i,val: 

•oat  Aral: 

vaM): 
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tor(jsO:)<npixels;|++)| 

val>val-»^rpixeKxoul(mx]|my]lj].yout(mx](myIj]):l 
tval =(ftoat^«l/npixel5 : 
retum(lval). 

1 


r  Calculates  L2  output  error  for  exemplar  (mx.my) 

In  learning  phase  */ 

float  error_leam<mx,my,n) 
int  mx.my.n; 

1 

mt  mx2,my2,iex,c; 
float  eZ.errxjr.lval: 

otTor»0; 

Csmy'mxmax-t-mx; 

srand<see<kc): 

fval>0: 

for(my2>0:my2<yniax1  :my2++){ 
for(mx2>0:mx2<xmax1  :mx2'f'f)| 
iex«txand(); 

e2>0  5*(iex-yy  2(mx2](my2]) : 
error<ierror-)«ompare_sign(iex  ,y  y  2|mx2](my  2]) ; ; 
delta2{mx21my  2|^2'ghump(yy2(mx2Kmy21) : 
Ivalafval-»^yy2(mx2|(my2];]] 

fvai>lval/(xmax  I'ymax  1 ) ; 

pnntff^Exemplar  (%d,%d)  errors  %f  projection^  %f’.mx.my,errDr.fval); 
if(c<2) 

proj_w[n][c)>fval; 
relum(  error), 


r  Calculates  L2  output  error  ter  exemplar  (mx.my)  in  reading  phase  V 

float  erTor_read(mx.my,n) 
int  mx.my.n; 

I 

Int  mx2,my2.iex,c; 
float  e2.erTor,teal: 

error-O; 

c=my*mxmax+mx; 

srandfseed-rc). 

lval>0: 

ter(my2a0:my2<ymax1  ;my2>-»){ 
ter(mx2>0:inx2<xmax1  :mx2'r-f){ 
iex«brand(); 
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e2>iex-yy2{mx2](my2] 

erTor>error-»co(npare_8ign(iex.yy2(mx2]|tTiy21): 

Ivalslval-»'yy2(nix2]{my2]:)) 

fvala  Val/(xmax  rymax  t ) ; 

pnntfCViExenipiar  (%d,%d)  error>  %( projection*  %f*,mx,fny,erTor.fyal): 
if(c<2) 

pcDL>1nIc)*hral: 

relumjenor); 


) 

r  Badtptopagale  t\e  error  signal  */ 

void  bacfcward(nfraines) 
xitnframes, 

{ 

longnikiong. 

int  mx2,my2.i2.pixval; 

float  el  .fval; 

«90«(); 

sethdrv(SPONNJNPUT); 

H>on(); 

/*  Erase  LI  input  */ 

block(arTaydim2,ymax  1  *size.xmax*sae.y  max2*size.O) ; 

/*  WrSe  dBltB2  to  L2  retorenoe  */ 

Ior(my2»0:my2<ymax1  ;my2++)| 
lor(mx2>0:mx2<xiT)axl  :mx24^+){ 
pixvaU  1 27’deita2(mx2](my2] . 

block(xin_ret|2*mx21,yin_t»f[2*my2+ymax  1 + 1  l.size.size.l  27+pixval) ; 
biocK(xin_fell2*mx2+ 1  l,yin_reH2*my2+yiTwx  1 + 1  l,si2e,size.  1 27-pixval) ; 
I) 

/*  Read  LI  output  and  calculate  dettat  */ 

KjoflO. 

8ethd*v(SPONN  OUTPLIT). 

<go«(). 

Ior(iny2>0:tny2<ymax1  :my2'»-^){ 

Iorjmx2«0-.mx2<xmax;mx2'«'*){ 

sum1(mx21(my2)*0: 

sum2(inx2Imy2]>0;|) 

oulp(baao1.REA0): 

lor(Wong*0:ldong<on_iime;Mong-»'f); 
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tor(  12=0  :i2<ntranies  :i2 -r + )t 
snap<WAtT): 

for(my2>0;my2<ymax1  ;my2-f+){ 
for(mx2=0;mx2<xfnax;mx2-f +)| 

sum  1  [mx2]lmy  2)«sum  1  lmx2]lmy2)4«vgL(P'x^('nx2.yinax  1  -i’  1  •('2'tny  2);]] 

) 

change_sion_L2(); 

tor(Mong4>:klong<an_time:klong4~f); 

«90«(): 

selhdw(SPONN_OUTPUT): 

»gon(); 

for(i2=0;i2<nfranies;i2-t^-f)| 

snapiWArT); 

tor(my2a0;my2<yinax1  ;my2-^+){ 
for(mx2>0;mx2<xmax;mx2't-'«'){ 

Sum2(mx21|my2)«sum2(mx2](my2]'t«/g_rpixel(mx2,ymax1+1+2*my2);)) 

( 

outp(base1  .READ); 
tvaUO.S/nlrames: 


tor(my2=0:my2<ymax1  ;my2++){ 
fof(mx2»0:mx2<xmax  :mx2++ ){ 
sum  1  |mx2](my2>B>val*sum  1  (mx2)|my2]: 
sum2(mx2]|my2|=tvarsum2(mx2Hmy21;)| 

tor(my2«0:my2<ymax1  ;my2++){ 
tor(mx2«0:mx2<xmax1  ;mx24'-f)( 

e  1  >0  00392*(8um  1  [2’mx2Imy2]-sum  1  [2*mx2+1  ](my2)-sum2(2*mx2](my2]^sum2(2*mx2-»  1  ][my2]) : 
detta1|mx2Hmy2)>«rgtHimp(yy1(mx2Kmy2])j} 


I 

/*  Tbs  luncMn  updates  the  vveights  between  L2  and  LI  V 

void  tiain_W2(ini_iime.deoquanteps1  .eps2) 

ntdecquant, 

lloatapBl,ep62: 

long  int  int_time, 

I 

int  mx2,my2,pixval; 
long  rtt  klong: 

fgo«(); 

sethdw(SPONN_INPUT), 

fgonO. 


rwiiiayl  toll  obiect*/ 

tor(my2-0:my2<ymax1  ;my24'*){ 
toftmx2-0;mx2<xmax1  :mx2'f'f){ 
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if(deoquant) 

pixval>  1 27*hsgn(yy  1  (mx2Jniy2).eps  1 ): 
else 

pixval>  1 27*yy  1  (mx2](my2]. 

block|xvi_obj|2*mx2).yin_obi[2*iTiy2-)-yinaxU  1  ).8ae4ize.  1 27-^pixval) : 
block(  xin_ot)j(2’mx24'  1  ],yin_obi[2*niy2«ymax  1  *  1  ),si2e,size,  1 27-p(xval) : 

I) 

/*  WniB  delta2  to  L2  object  */ 

for(my2»0;my2<ytnax1  :my2+-f){ 
tor(mx2«0;inx2<xmax1  ;mx2-«"f)( 
if(deoquant) 

pixvalsl27*h6gn<delta2(mx2Imy2].ep62): 

else 

plxval>i  1 27*dalta2(mx2I(ny2] ; 

block(xin_obj{2*mx2].yin_ot)|2*my2-»^ymax  1  ].size.si2e.  127-fp<xval); 
blocl((xin_obj(2*mx2+ 1  ].yin_ot)j(2’my2-^ymax  1  ).si2e^ize.  1 27-plxval): 

I) 

r  Whie  y1  to  Li  rateronoe  */ 

fof(my2«0:my2<ymax1  :my2++){ 

Ioftmx2>i0;mx2<xinax1  ;mx2-f-f)| 
if(dea)uant) 

p)xval>  1 27*hsgn(yy  1  (mx2Imy2).ep8 1 ) ; 
else 

pixval- 1 27’yy  1  (mx21fmy21; 

btodt(xin_rel{2*mx2].yin_re((2’my2-f  ymax  1  ).slze.size.  1 27+ptxval) ; 
block(xin_rell2'mx2+ 1  ),yin_rBfl2*my2+ymax  1  l.sae.size.  1 27'pixval); 

II 

Write  dBltB2  to  L2  relererce  ’/ 

tor(my2a0:my2<ymax1  :my2'f-f){ 
tortmx2-0;mx2<xm«x1  :mx2++)| 
ii(dBoquant) 

p)xval>127*hBgr«<dBltB2(mx2Imy2|.eps2); 

else 

pixval>  1 27’d8itB2(mx2Imy  2] . 

blocfc(xin_reil2*mx2].yin_rBl(2*iTiy2-^ymax  1 + 1  j.size.size.  1 27+ptxval); 
bloclt<xin_rell2*mx2+ 1 1,yin_rell2*my2+ymax  1  ♦  1  j.size.size.  1 27-pixval) : 

II  . 

r  Adjust  L1-L2  and  L2-L1  weights  */ 

o«jlp(beBe1. WRITE); 

lor(klongaO:ldong<>int_titne:ldong++); 

outp(baae1.%AO); 

block(0.ymax  1  *size.arraydiffl.ymax2*size.O). 

I 
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r  Tbs  luncoon  updates  the  weighisbeAmenLO  and  LI  */ 


vokt  train_W1(int_time,decquanLep62.mx,my) 
long  int  in(_tffle: 
mt  mx.my.deoquant, 
noatep62; 

I 

int  nix2.my2.xbaBe,yi>ase.ptxvat: 
long  nt  klong,k/al; 

•go«(): 

sethdMr(SPONNJNPUT); 

»9on(): 

r  Oisptay  LO  exemplar  */ 

disp_call(  mx  .my -t- 1 ) ; 

r  Write  deltat  to  L1  ob|ecl  */ 

xbasesmx'arraydim; 
y  base>(my  + 1  )*arraydim ; 

for(my2«0:my2<ymax1  :my2+-i')( 
tor(mx2>0;mx2<xmax1  ;mx2-)-4)| 
il(d8oquant) 

pixval>127*hBgn(delta1(mx2|my2].eps2): 

else 

pixvala  1 27’delta  1  (mx2Imy  2) ; 

btoc*<xbaBe+xin_obj(2’mx2),yt)ase+yin_obj(my2),si2e,sae.127*pixval); 
bloek(xba8e+xin_objJ2'mx2+ 1  l,ybese-»yin_oti|{my2].size.size.  1 27-pixval) ; 
I) 

/*  Ad|ust  LO-L1  weights  */ 


lval»2*int_Sme, 

oulp< beset  .WRITE), 

k>r(  Mong  aO :  Klong  <  >lval  :Uong  « . 

outp(base1  READ). 

blocfc(xbase.ybase.arraydim2.ymex  1  'size.O) . 


f  Funcaon  to  change  sign  ol  LI  rat  during  readout  ’/ 

voKt  change_sign_L1() 

I 

Int  mx2.my2.pixval: 


saMw(SPONN_INPUT); 

•90n(), 
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tor(my2=0;my2<ymax1  ;my2++){ 
tor(mx2=0:mx2<xmax1  :mx2'»-^)| 
pixval*- 1 27’yy  1  |aix21my2] ; 

biock(xin_rell2*mx2],yin_re(I2*my2-fymax1],sizeMe.127+pixval); 

block(xin_re(l2*mx24^1],yin_iBi(2*my2'fymax1],size,stze,127'Ptxvai); 

)} 

1 

r  Functon  ID  change  sign  of  L2  ret.  during  readout  */ 
void  changejugn_L2() 

I 

int  mx2.my2,pixval: 

•goffO; 

sethdie(SPONNJNPUT); 

fgonO; 

for(my2>0:my2<ymax1  :my2-t'+){ 
tor(mx2«0:mx2<xmax1  :mx2+4')( 
pixval-'1 27'deita2(mx2|my2] ; 

biock(xin_ref[2*mx2],yin_ref(2*my2-f  ymax  1 + 1  ].si2e,si2e.  1 27+^xvai): 
t)lock(xin_ref|2*mx2+ 1 1,yin_fell2*niy2+ytnax1  ♦  1  J,si2e,si2e.  1 27-pixval) ; 


I 

r  Neuron  sigmoKlal  activation  function  V 

float  tsigmoffsuni) 
float  Isum: 

( 

float  Ival: 

lval-2.(y(  1  +exp(-tgafTinia'fsum))- 1.0; 
retum(lval); 

I 


/*  Backpropagam  function  */ 


float  ghurnpftsum) 
float  fsum; 

{ 

float  fval; 

fval>(  1 .0-fsufn)*(  1  .&ffsuin): 
retum(lval): 

1 


r  Saluralon  function  V 


int  saturffsum) 
float  fsum; 

I 

int  ival; 
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if(tsum>s255) 

ivai>2S5; 

etee 

ivaUlsum: 

retum<ival): 

) 

r  Tnnary  quantization  (unction  */ 

nt  hsgn(net,ep6) 
float  neteps: 

( 

intivai; 

it(net<-eps) 

ival»-1, 

else  if(net>ep6) 
ival=1 : 
else 
ivalsiK}; 

retum(ival); 


r  This  subroutine  generates  a  ( 1 ,0)  LO  rarxtom  exemplar  and  stores 
It  in  a  subregion  of  frame  memory  */ 

void  exemp(ar_rBrKtom_LO(m,n) 
int  m.n. 

I 

int  mx2.my2.xba5e.ybase.pixval; 

•goffO; 

sothdiiv(SRDNN_INPUT): 

>gon{); 

xbasesm  'arraydim . 
ybases(n+ 1  )'arraydim . 

srandfseed-fn’mxrnax-fm-fmmax): 

(or(my2>0:my2<ymax1  :my2'r^){ 

(or(mx2a0;mx2<xmax;mx2-r-^){ 
i((brand()— 1)1 

block(xin_ra(|mx2)4.xbase.y«i_ra(lmy2)-fybese.si2e^e.255).) 

else( 

biock(xin_re(Imx2]>xbase.yin_reffmy2Kyb8se.size.size.O);) 


f  Helufns  1  or  -1  randomly  7 


int  brand() 

I 

retum<2‘geMs<rand(),0, 1  >-1 ): 

I 

r  This  function  g«lsn  bits  10  the  right  of  and  inducing  poGitionp 
in  integer  X.  7 

getfait5(x,p,n) 
unsigned  x.p.n; 

I 

retum((x»(p+1-n))&~(-0«n)); 

I 

r  This  subroulirte  displays  cell  (m.n)  7 

votd  disp_cell(m,n) 
intm.n; 

nt  xbase.ybase, 

<90«<); 

sethdiw(SPONN_INPUT); 

•gonO: 

xbasesm'arraydim: 
ybasesn*  arrayctim : 


panfxbase): 

scroll(ybase); 

) 

/*  Returns  0  it  arguments  have  same  sign.  1  otherwise  7 

float  compare_sign(m.tn) 
int  m. 
float  In. 

I 

float  fval. 

if(m*fn>0) 

fvat>0. 

else 

IvaUt. 

retumffval); 

) 
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Appendix  B:  Publications  and  Presentations 

Publications 

Y.  Owechko,  "Cascaded-Grating  Holography  for  Artificial  Neural  Networks,"  accepted  for 
publication  in  Applied  Optics,  1992. 

Y.  Owechko,  "Optical  Implementation  of  Backpropagation  Neural  Network  Using 
Cascaded-Grating  Holography,"  to  be  submitted  to  International  Journal  of  Optical 
Computing  in  1992  (Invited  Paper). 

Y.  Owechko  and  B.  H.  Soffer,  "A  Programmable  Optical  Neuro-Computer  Based  on 
Photorefractive  Holograms,"  Proceedings  of  Government  Microcircuit  Applications 
Conference,  Las  Vegas,  1992. 

Y.  Owechko  and  B.  H.  Soffer,  "Multi-Layer  Optical  Neural  Networks,"  SPIE  Proceedings 
Vol  1773,  1992. 

Y.  Owechko  and  B.  H.  Soffer,  "An  Optical  Interconnection  Method  for  Neural  Networks 
Using  Self-Pumped  Phase  Conjugate  Mirrors,"  Optics  Letters  16,  675-677  (1991). 

G.  J.  Dunning,  Y.  Owechko,  and  B.H.  Soffer,  "Hybrid  Opto-Electronic  Neural  Networks 
Using  a  Mutually-Pumped  Phase  Conjugate  Mirror,"  Optics  Letters  16, 928-930(1991). 

Y.  Owechko  and  B.  H.  Soffer,  "Optical  Neural  Networks  Based  on  Liquid  Crystal  Light 
Valves  and  Photorefractive  Crystds,"  Proceedings  of  SPIE/SPSE  Symposium  on  Electronic 
Imaging  Science  and  Technology,  San  Jose,  Feb.  1991,  Vol.  1455, 136-144.  (Invited  Paper) 

B.H.  Soffer,  Y.  Owechko,  and  G.J.  Dunning,  "A  Photorefractive  Optical  Neural  Network," 
SPIE  Proceedings  Vol.  1347, 1,  (1990).  (Invited  Paper) 

B.H.  Soffer,  Y.  Owechko,  and  G.J.  Dunning,  "A  Photorefractive  Optical  Neural  Network," 
Proceedings  of  15th  Congress  of  International  Commission  for  Optics,  Aug.  1990, 

Garmish,  Germany. 


Presentations 

Y.  Owechko,  "Programmable  Optical  Neural  Networks,"  OS  A  1992  Annual  Meeting, 
Albuquerque,  1992.  (Invited  TaUc) 

Y.  Owechko  and^.  H.  Soffer,  "A  Programmable  Optical  Neuro-Computer  Based  on 
Photorefractive  Holograms,"  Government  Microcircuit  Applications  (I)onference,  Las 
Vegas,  1992. 

Y.  Owechko,  Lake  Louis  conference  on  electronic  and  optical  implementations  of  neural 
networks,  1992. 
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Y.  Owechko  and  B.  H.  Soffer,  "Multi-Layer  Optical  Neural  Networks,"  SPEE  Conference 
1773  on  Photonics  for  Computers,  Neural  Networks,  and  Memories,  San  Diego,  1992. 

Y.  Owechko  and  B.  H.  Soffer,  "Recent  Advances  in  Optical  Neural  Networks  Using  Beam 
Fanning  in  Photoreffactive  Crystals,"  LEOS  '91,  San  Jose,  Nov.  1991.  (Invited  Talk) 

Y.  Owechko  and  B.  H.  Soffer,  "Optical  Neural  Networks  Based  on  Liquid  Crystal  Light 
Valves  and  Photorefractive  CrystaJs,"  SPEE/SPSE  Symposium  on  Electronic  Imaging 
Science  and  Technology,  San  Jose,  Feb.  1991.  (Invited  Talk) 

Y.  Owechko,  G.  Dunning,  and  B.  Soffer,  "Optical  Neural  Networks  Based  on  Stimulated 
Photorefractive  Effects,"  OSA  1990  Annual  Meeting,  Boston.  (Invited  Talk) 

Y.  Owechko,  "Holographic  Neural  Networks,"  1990  International  Topical  Meeting  on 
Optical  Computing,  Kobe,  Japan.  (Invited  Talk) 

Y.  Owechko,  B.H.  Soffer,  and  G.J.  Dunning,  "Photorefractive  Optical  Neural  Networks," 
International  Joint  Conference  on  Neural  Networks,  Washington,  D.C.,  1990.  (Invited 
Talk) 

G.J.  Dunning  and  Y.  Owechko,  "Multi-Pon  Optical  Neural  Network  Using  a  Mutually 
Pumped  Phase  Conjugate  Mirror,"  IEEE  Meeting  on  Nonlinear  Optics:  Materials, 
Phenomena,  and  Devices,  Kuau,  1990. 

G.J.  Dunning,  Y.  Owechko,  and  B.H.  Soffer,  "Optical  Neural  Network  Using  a  Mutually 
Pumped  Phase  Conjugate  Mirror,"  OSA  Annual  Meeting,  Orlando,  1989. 

Y.  Owechko,  "Self-Pumped  Optical  Neural  Networks,"  OSA  Topical  Meeting  on  Optical 
Computing,  Salt  Lake  City,  1989. 

Y.  Owechko,  "Stimulated  Photorefractive  Optical  Neural  Networks,"  Third  Annual  Parallel 
Processing  Symposium,  Cal.  State  Fullerton,  1989.  (Invited  Talk) 
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